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ARNOLD DIFFUSION IN NEARLY INTEGRABLE HAMILTONIAN 
SYSTEMS OF ARBITRARY DEGREES OF FREEDOM 


CHONG-QING CHENG AND JINXIN XUE 


Abstract. In this paper Arnold diffusion is proved to be a generic phenomenon 
in nearly integrable convex Hamiltonian systems with arbitrarily many degrees of 
freedom: 

H{x,y) = h{y) + sP{x,y), x£T", t/£ R"', n>3. 

Under typical perturbation eP, the system admits “connecting” orbit that passes 
through any finitely many prescribed small balls in the same energy level H~^(E) 
provided E > minh. 
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1. Introduction 

In this paper, we consider nearly integrable Hamiltonian systems of the form 

(1.1) H{x,y) = h{y)+ eP{x,y), {x,y)GT*T^, n > 3. 

where h is strictly convex, namely, the Hessian matrix is positive definite. It is 
also assumed that both h and P are C"’-function with r > 2n and min/i = 0. 
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The problem of studying the (in) stability of the above system H was considered to 
be the fundamental problem of Hamiltonian dynamics by Poincare. According to the 
celebrated KAM theorem, there exists a large measure Cantor set of Lagrangian tori 
on which the dynamics is conjugate to irrational rotations and the oscillation of the 
slow variable (or called action variable) y is at most 0{^/e). The KAM theorem also 
excludes the possibility of large oscillation of y in the case of n = 2 since each energy 
level, which is three dimensional, is laminated by two dimensional KAM tori and each 
orbit either stays on a KAM torus or is confined between two tori. 

For n > 3, there does not exist topological obstruction for the slow variables y to 
have 0(1) oscillation. Arnold was the first one who had realized such instability |A63j 
and constructed the first example in |A64j half a century ago 

(1.2) H{I, 9,y,x,t) = — + ^ + e{cosx + !)(! + fi{cos6 + sinf)). 


where there are orbits along which the action variable I has as large oscillation as we 
wish. Although the perturbation is far from being typical, Arnold still proposed 


Conjecture l |A66j i: The “general case” for a Hamiltonian system ( |1.1[ ) with n > 3 
is represented by the situation that for an arbitrary pair of neighborhood of tori y = y', 
y = y", in one component of the level set h{y) = h{y') there exists, for suffieiently 
small e, an orbit intersecting both neighborhoods. 


The problem is named Arnold diffusion. In recent years, it has become clear that 
Arnold diffusion is a typical phenomenon in so called a priori unstable systems, which 
are small perturbation of compound pendulum-single rotator system. There are many 
works studying this problem based on two streams of methods: the variational method 
(c.f. [R^ICYTlICY^rnCj i and the geometric method fc.f. [DLSOSlELSHI E] ) • With 
the variational method, the genericity of perturbations was established in |CY1([CTY2| . 
which relies on the existence of a normally hyperbolic invariant cylinder (NHIC) of 
dimension two given by the a priori unstable condition. For the geometric method, 
we highlight the central object, the scattering map, whose symplectic properties are 
studied in details in [DLSOS] . 


Nearly integrable Hamiltonian like (1.1) is also called a priori stable system, where 
a frequency ^{u) = ^ is said to admit a resonance relation, if there exists an integer 
vector k G Z” \ {0} such that (k,a;(?/)) = 0 at some point y. The number of linearly 
independent resonance relations is called the multiplicity of the resonance. A bit away 
from strong multiple resonance in such systems, some pieces of NHIC still exist and 
the method for a priori unstable system can still be applied |Be31 IBKZj . However, 
these pieces of NHIC are separated by small neighborhood of multiple resonances. 
Without studying how to pass through neighborhood of strong multiple resonances, 
it would be impossible to construct orbits which can drift for large scale. 


The problem of Arnold diffusion has a long history, and the mathematical issues 
that arise are subtle and technically involved. Many key ideas can be traced back to 
the early published and unpublished work of Mather (see the announcement |M04j in 
2004 and published errata |M12j in 2012, as well as the paper |Mllj i. Even in the 
case n = 3, there are (as of the date of this paper) no published proofs of diffusion. 
There are, however, several papers on the arxiv claiming the result for n = 3: a paper 
of Cheng was first posted in Jul. 2012 (updated in Mar. 2013, later decomposed into 
|C15alIClSbllCZl] in 2015 and |CZ1) is accepted); between the first and second version 
of Cheng’s paper, Kaloshin and K. Zhang posted a paper |KZ1] on arxiv in Dec. 2012 
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(updated in Jan 2013 and a published announcement |KZ4| h Marco and Sabbagh 
posted partial solutions |MaSa| on arxiv in Jan. 2014 (see also [Me] a conference 
paper in 2015). 

In April of 2014, Kaloshin and Zhang posted on their webpages a 36-page announce¬ 
ment and key elements of a proof of diffusion |KZ2j in the case n = 3.5. Subsequently, 
they posted on the arxiv in October 2014 a longer paper |KZ3j containing details of 
the announced proof as well as partial results and techniques applicable to n > 4. This 
paper bears some important commonalities with part of our paper. We discuss these, 
as well as the crucial new ideas in this paper including issues of crossing complete 
resonance, in the next section. The present paper is the hrst posted complete proof 
of Arnold diffusion for general n > 4. 

1.1. Statement of the main result. By adding a constant to H and introducing 
a translation y ^ y + yo, one can assume min h{y) = h{0) = 0. For E > 0, let 
H~^{E) = {(x, y) : H{x, y) = E} denote the energy level set, B C M"" denote a ball in 
M” such that U£;'<_e-i-i ^ Let &a,^a C C^{T^ x B) denote a sphere and 

a bah with radius a > 0 respectively: F G Sq if and only ||F||cr = a and F G if 
and only ||F||cr- < a. They inherit the topology from C'^(T’^ x B). 

For perturbation P independent of y (for instance, in classical mechanical systems), 
we use the same notation &a, C C'^(T"') to denote a sphere and a bah with radius 
o > 0. 

Let DJa be a set residual in Sq, each P G DJa is associated with a set Rp residual 
in the interval [0,ap] with ap < a. A set (ta is said cusp-residual in iBa if 

€a = {XP : P Rp}- 

Let denote the Hamiltonian flow determined by H. Given an initial value {x,y), 
^}j{x,y) generates an orbit of the Hamiltonian flow {x{t),y{t)). An orbit {x{t),y{t)) 
is said to visit Bs{yo) C M” if there exists t G M such that y{t) G Bs{yo) a ball centered 
at yo with radius 6. 

Theorem 1.1. Given any small J > 0, there exists eg, sueh that given finitely many 
small balls Bfiyi) C M"', where yi G h~^{E) with E > 0, there exists a eusp-residual 
set Cep C C'^(T"’ X B) with r > 2n such that for each eP G Cep, the Hamiltonian flow 
admits orbits which visit the balls Bs{yi) in any prescribed order. Moreover, the 
theorem still holds if we replace the function space C'^(T"’ x B) by 

This is the main result of this paper. Our statement is stronger than the original 
conjecture of Arnold stated above. We establish the existence of J-dense orbits in the 
phase space for any given 5. We have the following remarks. 

(1) Potential or metric perturbations. Our perturbation is generic not only in 
usual sense, but also in the sense of Mane [Mij.i.e. it is a typical phenomenon when the 
system is perturbed by functions depending only on x, called potential perturbation. 
It is interesting to ask the following Riemannian metric version of Arnold diffusion 
(see m for an example). 

Problem: Suppose g = {g^j{x) + efij{x))dx^dxfi x G and i,j = l,...,n, is a 

, r > 2n Riemannian metric on T”, n > 3 where g^j{x)dx'^dx^ is flat. Prove that 
for “generic” f = (fij) in the unit sphere o/(C"’(T”))’^^”, any two small balls in the 
unit tangent bundle are connected by an orbit ( 7 , 7 ), where 'j is a geodesic, for 

sufficiently small e. 
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(2) The cusp-residual condition. Roughly speaking, cusp-residual means that 
for generic P G &i, there is ep depending on P such that (1.1) admits diffusing orbit 
for generic e < ep, but for given e, there is an open set on for eP therein the 
existence of diffusing orbit is not known. Our cusp-residual set has an extra structure 
called multiple filtration described in Section so we call it a multi-filtered cusp- 
residual (MFCR) set. 

(3) The smoothness of the perturbations. The smoothness requirement r > 2n 

matches the least KAM smoothness threshold. Namely, it was shown in [mu that 
KAM theory does not apply if the perturbation is small only in but not C'^"'- 

topology (see Remark 4.1). Our theorem holds also for r = oo, but the analytic 
perturbation case of Arnold diffusion remains open (see [nT 2 ] for a result on a priori 
chaotic systems). On the other hand, for Hamiltonians with low regularity, connecting 
lemma techniques can be applied to get topologically mixing for “most Hamiltonian” 
and “most energy levels” IHFRVj . See also [ABCl IBoCr] for other related resnlts on 
Cl generic symplectomorphisms. Robust transitivity is shown in |NPj to be true for 
an open set of Hamiltonians in C”, r > 2 by introducing a symplectic blender. For 
non Hamiltonian but volume preserving generic diffeomorphisms, it is shown in 
|ACWj that there is a dichotomy, i.e. either zero metric entropy or the volume being 
ergodic. 


This theorem is proved by variational method. The diffusion orbits are constructed 
to move along many pieces of NHIC and to cross multiple-strong resonance 
point (join two pieces of NHIC). 


1.1.1. Outline of our proof. The construction of diffusing orbits involves under¬ 
standing the pictures in four different spaces: the frequency space, the phase space, 
the parameter space (or the cohomology space Ffi(T”,M)) and the function space, of 
which we have a clear understanding summarized as follows. 

(1) The frequency space picture. 

• It is a folklore theorem that Fourier modes in the span^jk} appears in the 
KAM normal form if k is a resonance relation to the frequency uj{y) (Section 
[^. In other words, “resonance produces pendulum”. 

• We show that for any two balls in the frequency space, there is a connected 
curve joining them with a hierarchy structure (Section |4.4| ). Namely, along the 
curve each frequency has at least (n — 2 ) linearly independent resonant integer 
vectors k^, k^,..., k"’"^ G Z” forming a hierarchy |k*| <C |k*+^|, i = 1,, re—3 
except for finitely many points, there are (re — 1 ) linearly independent resonant 
integer vectors k^, k^,..., k”“^ G Z” forming a hierarchy |k®| <C |k*'’'^|, i ^ j, 
for some j = l,2 ,...,re — 2 and Ik-’l is comparable to |k-^'’'^|. 

(2) The phase space picture. 


The orbits always shadow certain Aubry-Mather set for twist maps or subsys¬ 
tems of two degrees of freedom. 

The (un)stable “manifolds” of these Aubry-Mather set do not always need to 
intersect transversally in order to implement Arnold’s mechanism. In some 
circumstance, it is enough to have incomplete intersections (see Theorem 7.1), 
namely they do not split along some but not all directions. 


(3) The parameter space (the cohomology space F7^(T”,]R)) picture. 
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• Based on the complete understanding of the flat of the a function for systems of 

two degrees of freedom in |C12[IC15b] . We get a continuous path in M) 

along which the c-equivalence is always satisfied (Section |6.2[ ). 

• The difficulty of the high dimensionality manifests itself as the channel mis¬ 
alignment problem (see Fig and compare Fig[^. We address it by intro¬ 
ducing a ladder construction (Section |6.4[ ). The corresponding picture in the 
frequency path has a detour of order close to strong double resonance (see 
Fig[§. 

(4) The function space picture. 

• We get a MFCR set using the parametric transversality (PT) results estab¬ 
lished in [CZll ICZ2[ ICYHICY2] , perturbations in which admit diffusing orbit 
(Section]^. 

• We verify the assumptions for the PT results by showing that the weak KAM 
solutions for systems with two degrees of freedom restricted to the energy 
level can be parametrized to have 1/3 Holder regularity with respect to the 
parameter. This is used in the ladder construction. 



Figure 1. The n = 3 case, (red) curves of cohomology equivalence 



Figure 2. The re > 3 case, the pizza and the ladder climbing 

Most of the above outlined work is devoted to understanding the dynamics near 
complete resonance where the frequency R'(y) admits re — 1 resonance relations two of 
which are strong. The main difficulty comes from the nonperturbative nature of the 
dynamics near strong resonance which is intrinsic to the a priori stable systems. Our 
work on crossing the complete resonance may have broader interests beyond Arnold 
diffusion. Away from the strong resonances along the frequency line, NHICs can be 
found so that the problem is reduced to the well understood a priori unstable system. 

We next briefly explain the construction of NHICs approaching the multiple-strong 
resonant points. A general idea is that sufficiently weak resonance can be averaged and 
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treated as small perturbation to those stronger resonance terms. In the earlier papers 
|KZ2l IKZ3| this idea is used, where the “dominant structure” is introduced to find 
a path of NHIC. Independently, we work out this idea in this paper in the following 
way. Inspired by Arnold’s remark in |A66) “it is necessary to examine the transition 
from single to double resonance” we choose the frequency line LOa = Aa(a, 
carefully, where iOn -3 ^ is of Diophantine. When the parameter a varies in an 

interval, we get a path in h~^{E) by choosing suitable Aq. 


Two types of KAM iteration are introduced so that we reduce the system 
(1.1) into a locally defined system with one less degree of freedom for generic 
Hamiltonian. It is guaranteed by a result |DLSn8| saying that a Hamiltonian 
system restricted on the NHIM is still Hamiltonian. Restricted on NHIM, the 
frequency vector turns out to be ui'^ = Aa(a,0, ^,Cjn- 3 Y after transformed by 
unimodular matrix. The zero entry in corresponds to the normal dimension, 
which is eliminated by the reduction. Next, we approximate w *_3 by a vector 
(|,£h*_ 4 ), where i® of Diophantine such that the new line segment oJa = 
Aa(a, |,w*_ 4 )* has the same structure as the uja before. 

Around strong double resonant point, extra action-angle coordinates are in¬ 
troduced (center straightening) so that the reduced system is nearly integrable 
again. Such coordinate transformation is singular only at the double resonant 
point. It allows us to repeat the procedure of reduction in place arbitrarily 
close to the multiple-strong resonance points. 

In our reduction of order, we obtain explicitly Hamiltonian systems of one less 
degrees of freedom and during each reduction of order we have at most two 
resonances to consider. 


Such procedure is repeated for n — 3 steps, certain NHICs are found to reach 
neighborhoods of multiple-strong resonant points with // <C 1. Once NHICs are ob¬ 
tained, we can construct diffusion orbits along these NHICs by the method developed 

in [CYTlEYllEIl]. 

The choice of path for NHIC in |KZ2[ IKZ3| is top-down. They first show that there 
exists “a connected p-dense admissible diffusion path”, by considering all resonant 
integer vectors satisfying certain assumptions, then choose the path implicitly from 
this connected set. Our approach is bottom-up. We first choose our diffusing path 
(frequency line segment) explicitly then determine all its resonance integer vectors 
explicitly. For any given two balls in the frequency space, we find a path connecting 
two rational points in the balls. To see the picture clearly, let us map the frequency 
path to M"’ by setting Aa = 1. Then the two rational points, as two opposite vertices, 
determines a cuboid. Our path connects the two opposite vertices moving along the 
edges of the cuboid and has the hierarchy structure described before. 


1.1.2. Structure of the paper. The paper is organized as follows. In Section to 
get two normal forms corresponding to single or double resonances we design two types 
of KAM iteration by applying a proceeding homogenization procedure, which are used 
to construct two type of reductions of order in Section correspondingly. One is for 
single and weak double resonance, another one is for strong double resonance so that 
we get a Hamiltonian system with one less degrees of freedom. Such construction is 
based on normally hyperbolic invariant manifold theorems and its Hamiltonian ver¬ 
sion following |DLS08] . Special efforts are made for strong double resonance, includes 
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locating of NHICs around the strong double resonance following |C12[ ICZ1| , introduc¬ 
ing new action-angle coordinates and performing shear transformation. In Section 
we show how to perform the procedure of reduction of order of Section inductively 
to get NHICs of two dimensions. We also explain how to connect two p balls in the 
frequency space by frequency lines of the standard form Wq. In Section we give a 
description of the Aubry set along the frequency line segment and around multiple- 
strong resonant points. In Section |6.2t we show how to pass through multiple-strong 
resonance, it involves “ladder climbing” to overcome the difficulty created by the triple 
or more multiple-strong resonance, building up the cohomology equivalence which is 
used to turn around the multiple-strong resonance. In Section we show how to 
construct diffusing orbits variationally. In Section we recapitulate all the genericity 
assumptions that we made and give a proof of the main theorem. In Section we 
establish a Holder regularity result for the parametrized weak KAM solutions, which 
is needed to complete the genericity argument of ladder construction. Finally, we give 
the preliminaries of the Mather theory and weak KAM in the Appendix |A| 


1.2. Convention of notations and terminology. In this section, we fix some 
standing conventions of notations for the rest of the paper. Please refer to Appendix 
[X| for a brief introduction of the Mather theory where more notations are given. 


1.2.1. Legendre transform. The Lagrangian L is related to the Hamiltonian H through 
Legendre transformation H{x,y) = maxx{x,y) — L{x,x). They produce the transfor¬ 
mation .^L- TM ^T*M, M = 

d L 

(x, x) (x, y) = .^l{x, x) = (^x, x)^ . 

Its inverse = -Z//: T*M —)• TM is determined by the Hamiltonian H, 

/ dH \ 

(x, y) (x, x) = .^ h { x , y) = (^x, -^{x, y) j 

Denote T*M —)• T*M the Hamiltonian flow produced by the function H and 
TM ^ TM the Lagrange flow produced by the function L. If L is related to H 
via Legendre transformation, then 

= cfi 

Let ai denote the a-function for the Lagrangian L. If the Lagrangian L is obtained 
from the Hamiltonian H through Legendre transformation, we use an to denote the a- 
function, i.e. an = ol- Similarly, Pl = Ph denotes the ;0-function for the Lagrangian 
L and for the Hamiltonian H respectively. 

Each Tonelli Lagrangian L induces a relation between the first cohomology group 
and the first homology group Mi(M, M) -H- iL^(M, M), called Fenchel-Legendre 

transformation: 

c G (p) aL{c) + /3l{p) = {c,p). 

If a Hamiltonian H is related to a Lagrangian L via Legendre transformation, we can 
also use the notation i.e. 
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1 . 2 . 2 . c-minimal curve and c-minimal orbit. A curve 7 : M —)■ M is called c-minimal 
if for any curve M —)■ M and for any to, ti,t'i G M with t'l = ti mod 1 one has 

[ {L{l{t),i{t),t) - {c,^{t)) + aL)dt < f {L{^{t),i{t),t) - {c,^{t)) + aL)dt, 

Jto Jto 

where the Lagrangian L is assumed time-l-periodic: L{-,t) = L{-,t + 1). If a curve 7 
is c-minimal, then ( 7 , 7 ) is called c-minimal orbit. 

1.2.3. \g-minimal periodic curve and \g-minimal periodic orbit. Consider a Tonelli 
Lagrangian L{x, x) independent of time defined on TM. A periodic curve 7 : [0, A“^] —)■ 
M is associated with a class [ 7 ] G Hi{M,'L). It is called A 5 -minimal periodic curve if 
one has 

/ L{-f{t),^{t))dt= ini / L{^{t),i{t))dt. 

Jo [i]=aJo 

In this case ( 7 , 7 ) (.ifL( 7 , 7 )) is called A^f-minimal orbit. 

Notations We list some conventions and notations. 

• {The vector norm) Our convention of using | • | as follows. 

* It is the usual absolute value when applied to real numbers. 

* It is the ii norm when applied to integer vectors k G Z” which are row 
vectors. 

* It is the £oc norm when applied to a frequency uj G MT which is column 
vector. 

So we can write estimate |(k,a;)| < |k| • |a;|. 

• Let a{i, j) be the permutation matrix, i.e. its {i, j)-entry, (j, i)-entry and {£, £)- 
entry {£ 7 ^ i,j) are equal to 1, all other entries are equal to zero. When it 
applies to an n X n matrix, it is also thought as n x n matrix by default. 

• (The projection operator) Let 7 r_j be an operator, when it applies to a vector, it 

eliminate the i-th entry, for instance, 7 r_ 2 (ui, U 2 , U 3 , • • • , Vn) = {vi,V 3 , - ■ ■ , Vn). 
When it applies to a matrix, it eliminate the i-th row and the i-th column. 
The operator vr+j is introduced by adding 0 to a vector as the i-th entry, for 
instance, 71 + 2 ( 1 ^ 1 , • • • , Vn) = ('Wi, 0, U 3 , • • • , Vn). For a matrix M ^ W , j > 

1, we define 7 r+i(M) G as the enlarged matrix obtained by putting 

zeros as the new i-th row and column except for the (i,i) entry, we put 1 . 

• {the hat notation) We fix the meaning of the hat notation throughout the 
paper. For a vector v G M”, we use Vn-i to denote the vector {vi+i,Vi+ 2 , ■ ■ ■, Vn) 
in for 1 < i < n. 

• {the tilde notation) Dual to the hat notation, we introduce the tilde notation. 
For a vector v G M”, we use Vi to denote the vector {vi,... ,Vi) G M*, 3 < i < n. 
We omit the subscript i if i = 2. 

• {the ' and " notations) We use ' to indicate the single resonance and " for the 
double resonance (see below). We never use them to take derivatives in this 
paper. 

• (o number as super- or subscripts) A vector may carry numeric super- or sub¬ 
scripts. The superscript counts the step of reduction of order and the subscript 
indicates the vector entries. Such a superscript for a vector has no danger of 
confusion with a power. If a matrix or a scalar carries a numeric subscript, 
we fix its meaning as counting the step of reduction of order and its numeric 
super-script is the usual power. 
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2. The normal form 


In this section, we construct the frequency vectors with special number theoretic 
properties and derive Hamiltonian normal forms associated to such frequency vectors. 

Definition 2.1. We say a Diophantine vector v G is in T)C{d,a,T) if we have 

(2.1) |((i,^),k>|>^. kezGio). 

2.1. Number theoretic properties of the frequency line. What we are going to 
prove here will be used in the following KAM iterations. 


2.1.1. Single resonance. Consider frequency vector u)a G M"' of the form 

t 


( 2 . 2 ) 


UJ, 


= \a 


P P ^ 


^ , P,Q,p,q £ Tj, a G M, ^^-3 ^ h)C(n — 2, a, r). 


To determine the parameter Xa, we first think uja as a point in the projective space 
]^]pn-i gQ jg determined up to a nonzero scalar multiple, which does not influence 
the resonance relations. Since we know that y lies on an energy level E and since the 
energy hyper surface h~^{E) encloses a convex set containing the origin, the equation 
{tJa)) = E, u}{y) = dyh{y), determines uniquely the scalar multiple denoted by 


Xa- For example, when h{y) = gUylllji see easily that Xa = 


V2E 






Since we assume G DC(n — 2, a, r), we have at most two resonances as a varies 
in an interval. We always have a hrst resonance given by the integer vector 

k' = (0,Qp, -qP,bn-3)/d, d := g.c.d.{pQ,Pq). 


The g.c.d. of all the components of k' is 1. Then we have 



■ 1 0 0 0„_3 


a 


a 

(2.3) 

n QP _qP n „ 

^ d d 


p 

Q 

P 

— 

0 

A. 


0 r S 0n_3 


Q 


qQ 


. 0)2 — 3 0)2—3 0)2—3 id)2—3 _ 


L W)2-3 j 


L ^n-3 J 


where r, s are such that sQp + rqP = d. We denote the n x n matrix by M', which is 
in SL{n,Z). 


2.1.2. Double resonance, away from triple or higher resonances. In this section, we 
consider that the vector (2.2) at double resonance. We fix some large number K. As 
a varies in an interval, we may encounter double resonant points 

{wq I (k, uja) = 0, for some k G Z”" \ span^ {k^} , and |k| < AT} . 


There are finitely many such double resonant points, whose number depends only on 

K. 


The next lemma shows that for hxed K, points along the frequency line Wa are 
uniformly bounded away from triple or higher resonances. 

Lemma 2.1. //k" £ \ spang {k'} is the second resonance of Xa(fi, , 

(k",wa) = 0, |k"|<i^, 

where £ DC(n — 2, a, t), then for all 

k = (fci, A: 2 , ..., kn) G Z” \ spang {k', k''} , and |k| < K, 
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we have the estimate 

(2.4) |(k,a;,)| > 


a ■ inf„ A„ 


2^iqQy+H\\M'\\ocK) 


2t+1 • 


Proof. We use the linear transformation (2.3) to convert uia to the vector 

Lj'^ = M'uia = Aa (a, 0, ^. 

Denote by k" = {k'{, k'f ,..., k'f) := k"M'“^ so that we have 
0 = (k",Wa) = {k"M'-\M'iOa) := 

We have that k'f ^ 0 since otherwise (k",t<;^) = 0 for all a, which is impossible 
considering that w* _3 is Diophantine. We want to bound | (k, Wq) | from below for all 

k = (fei, /c 2 , ■ • •, kn) G Z” \ spang jk', k"} , and |k| < K. 

We denote k = (fci, ^ 2 , • • •, kn) ■= kM'~^ to get 

(kjWa) = (kM'“\M'a;a) = ^k,a;(j^. 

We introduce a new vector k as follows 

k = k — ^k!' = —(k'fk — kik!'). 
h" h"^ ^ 

The new vector 

k'ik = k'fk - fcik" := ( 0 , h, h, fen- 3 ) G 

has zero first entry. We introduce further a new vector 

k= (o,dk2,dk3,qQkn-3] G Z"-. 


We estimate the norm of k as 

|k| < qQ\kfk - kik”\ < 2qQ\k”\ ■ |k| < 2gg(||M'||oo • K)^ 

Using the Diophantine conditions and the fact that has zero second entry, we have 


Kk,Wa)| = 




(2.5) 


k'lqQ 


> 


= Kk,-0l = 

k, ( 0 , 0 ,i,cu:_ 3 ) 
ainfa Aa 


1 


_((fc'/k-fcik"),o;' 

infa Aa a 


> 


k'lqQ 


2-{qQ)-+W\M'\\^K)‘^-+^' 


□ 


Finally, we have the following fact. 

Lemma 2.2. Suppose the g.c.d. of the entries ofk" is 1. Then there exists a matrix 
M" G 5'L(n, Z) whose first row is k" = k”M'~^ with the first entry k'f 7 ^ 0, and the 
second entry k'f = 0 , such that M"M'iOa = (0, 0, c 5 ((„_ 2 ), where the is a non 

resonant vector in Moreover, the matrix M" has a special form such that 

a(l, 2 )M"a(l, 2 ) = diag{l,M"_ 3 }, 

where is a matrix of order n — 1, its first row is the vector 7 r_ 2 k". 
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Proof. Denote by = M'uja- We have {\P',uJa) = {\<i"M'~^, 00 '^) = 0. We set the 
second entry of k" to be zero and treat it as a vector in It is a basic fact in 

number theory that we can hnd re — 2 integer vectors which span unit volume together 
with k". By adding a number 0 as their second entry, we extend these vectors to 
be re-dimensional and put these vectors together to get an re x re matrix M" whose 
hrst row is k" ;= k"(M')~^, the second row is (0, 1 , 0 ,..., 0 ), it satisfies the properties 
stated in the lemma. □ 


2.2. Resonant submanifolds and their neighborhoods. Let oIq, k', k" be as in 

Section 2.1 We define the single resonant sub-manifold associated to the vector k' 

(2.6) E' := {y \ (k',w(y)) = O) . 

The following resonant curve determined by the frequency line oJa lies in E' 

(2.7) r; = c E'. 


In the single resonant sub-manifold we define the double resonant sub-manifold for 
the resonant vectors k',k" 

(2.8) E" := {y \ (k',a;(y)) = (k",a;(y)) = O} , 


Next, we hnd a number y as the size of the neighborhood of the single resonant 
manifold such that within this neighborhood, either Proposition |2 . 1| or Proposition 
(in the next subsection) applies. We use the notation B{a, r) to denote a ball of radius 
r centered at a and the notation B{A, r) := \Ja&AB{a, r) to denote the r-neighborhood 
of a set A. 

We denote a'{, o!f^ the list of points such that the corresponding frequency 

vector lOq!' admits a second resonant vector k"„, z = l,2,...,rre. The total number of 

such a'^’s is bounded if we require |k"| <K for some given large number K. 

Lemma 2.3. Let oJa, K, k', k"„, i = 1,2,... ,m be as above. We define 

“i 

_ 1 a- infa \a 

^ ~ 2nK 2-(gQ)-+i(||M'||ooK)2-+i ‘ 

Let k"» be the (re — 1)-dimensional space orthogonal to the vector k",,, then 


(1) for all uj in the neighborhood 

B{ua,y)\[jB , 

i 

and for sufficiently small e we have 

|(k, w)|>e'^, Vk G Z” \ spangjk'}, |k| < iL. 

(2) for all UJ in B^uJa, p.)f]B (^uj^// + k((,;f, , for each i and for all 

k G Z" \ spang |k', k((//| , and |k| < K, 

we have 

I \ I ^ ^ 

- 2-+i(gQ)-+i(||M'||ooiL)2-+i' 


(2.9) 
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Proof. Part (1). We consider two cases depending on if k in the assumption is one of 
the double resonant vector k"„ or not. 

First we suppose k = k"„ for some i, then we get 

°-i 

I(k, w)! = I(k, Wa") + (k, W - UJa")\ = Kk, W - UJa")\- 


By the assumption, the projection of a; — uja" to the vector k^„ has length at least . 
This completes the proof in the case k = k"„ for some i since |k| > 1. 


Next, suppose k / k"„, Mi. Suppose there exists satisfying the assumption of 

part (1) but we have |(k, a;l)| < . We consider oja in the frequency line segment that 

is within /r distance of , i.e. — Wal < /r for some a. First we must have the first 
entry ki of k is non vanishing. Otherwise, we have that the vector kM'“^ has zero 
hrst entry and M'oja = (o, 0, has zero second entry. We have the estimate 


( 2 . 10 ) 


I (k, Ola) I = \{kM'-\M'0Oa}\ > 


a ■ infa Aq 


(qQy+H\\M'\UK)2r+i 


using the Diophantine property of We get 


( 2 . 11 ) 


|(k,a;'^)| > Kk,a;a)| - |(k,u;'^ - (jJa)\ 

a ■ infa Aa 

{qQY+W\M'\\^KY-+^ 

a ■ infa Aa 


> 


> 


— nKn 


2{qQY+Y\\M'\\ooKY^+^ 
which is a contradiction. Next, since ki Y 0) we change the first entry a of oia to d' := 
o — to get another frequency vector Wa". We have by definition (k, Wa") = 0. 

This contradicts to the assumption that k Y kY V i. 

Part (2). For given uj as assumed, we have |a;—Wa"! < As k G Z"'\spang{k', k"„} 
and |k| <111, we have the following estimate 

l(k,w)| = |(k,a;a») + {'k,u) -UaYl 
> |(k,a;a»)| - |(k,a; - uja")\ 
a ■ infa Aa 


> 


> 


2YqQY^W\M'\\ooKY^^^ 

a ■ infa Aa 


— reAT/i 


2-+l((?Q)"+l(||M'||ooi^)2-+l 

where in the second inequality, we apply Lemma |2.1| and in the third inequality, we 
apply the definition of /U. □ 


We choose a < such that oj in the neighborhood of item (1) of Lemma 


satisfy the assumption of Proposition 2.1 


2.3 


all 


2.3. Homogenization. We hrst introduce the C'’-norm as follows. 

Definition 2.2. For functions f{x,y) defined on a domain T> x we define the 


\f\r ■= sup 

yeV 


E E 

keZ" |a+/3|<r 


df^ 

dyOL 



norm as 
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where is the k-t/i Fourier coefficient and we use the multi-index notation = 
• • -x"", etc. for a = (01,02, • • • ,On), j 3 = (/ 3 i,/ 32 , ■■■ffin) e Z'", o*, A > 0 , i = 

1 .2.. .. ,n. 

2.3.1. Covering a ^-neighborhood Bioja, of the frequency line oJa- For the frequency 
line segment oia, consider any a" such that there is a second resonant vector k"„. The 
set dh~^ (yB{uja" + k"^,e'^)) is an 0 (e'^)-neighborhood of S" in the space of action 
variables. We choose some large number A and cover this 0(e‘^)-neighborhood of S" 
using balls of radii Ae'^ centered on S". 

In the complement of the above 0(e'^)-neighborhood of the double resonance sub¬ 
manifold S" inside B{uja, fr), we use a covering by balls of radii A^/e. Let y* to denote 
the center of any of the above balls. When we are considering a ball covering Ti", we 
require y* G T,". Whenever a ball covers the single resonance sub-manifold S', we 
require y* G S'. 


2.3.2. Homogenization. We introduce the homogenization operator 

(2.12) S 3 : y-y*:=y/eY, t = Tfy/£, H{x,y) = sh\{x,Y), 

where Y, r, H are the homogenized action variable, time and Hamiltonian respectively. 
The Hamiltonian becomes 

(2.13) H(x, T) = ^ Y) + J(Ay, T) + V(x) + P(x, Vi>^), 

£ ye z 

where 


( 1 ) 


Hy*) 


-\- ■^{u*,Y) -|- ^{AY,Y) is the first three terms of the Taylor expansion 


of h{y) around y*; 


(2) cu* = f (y*); 

(3) A = |p^(y*) is a positive definite constant matrix; 

(4) V(x) = P{x,y*) is the constant term in the Taylor expansion of P{x,y) with 
respect to the variable y; 

(5) The term P has a decomposition P = P/ -I- Pjj where 

1 


P/=- 


(2.14) 


£ 

6 


+ V^T) - h{y*) - ffi-£{uj, y) - I (Ay, y)), 


/o dyidyjdyk 
Ph =P{x,y* + ffi£Y)-P{x,y*) 


{ty -F (1 - t)y*)r dt 


=^/^(Y, 


dP 


The estimate of the remainders Pj and Pjj relies on the size of the balls. When a 
ball of radius is used to cover S", we have the following estimate. 

(2.15) ' ' 


dx’^dYh 


< \/3\ > 0, for |y| < 


where > 0 is an upper bound of C"’^^-norm of h and P, in terms of (x,y). P/ is 
independent of x and is large for large Y, however we have 

|9^P/j 


(2.16) 


dYh 


= 0{ffi£) for |y| < const, as e —^ 0 and \j3\ < r. 
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2.4. The normal form around single resonance. In this section, we work out it. 


Proposition 2.1. Given any small 5, suppose y* is such that lo* = dh{y*) G B{uja, y) 
and satisfies the following for some constant Ci > 0 , 


(2.17) 


|(k, w*)| > C'ie’’+2 5 k G Z"" \ spangjk'}, |k| < K = (<5/3) 2 , 


Denote the frequency map by u{y) = dh{y). Then there exists a symplectic transfor¬ 
mation (j) defined on {£~^/^B{y*, \/e)) x which is O£^.o(l) close to identity in the 
C^~'^ topology, |^ —id|r -2 = 62 ), sending the Hamiltonian H G 0 / (2.13) 

to the following form provided e is sufficiently small 

H o fiix, Y) =h{y*) + T) + J (AT, T) + P ((k', x)) 

(2.18) e 2 

+ 6 Ri{x) + e 2 (r+ 2 ) Rjj[x, Y), 

where V consists of all the Fourier modes in V with k G span^jk'}, the remainder 6 Rj 
consists of all the Fourier modes in V with |k| > K, and the norm ofV, R/, R// 
satisfies \V\r, |R/|r- 2 ) |R//|r -2 < 0 ( 1 ) as 5,e ^ 0 . 


Remark 2.1. Considering the part (1) of Lemma 2.3. the assumption (2.17) on the 
frequency u* is satisfied if y* is not the center of the balls covering -neighborhood of 
choice of the covering, such a hall centered at y* has radius ffi~e. 


Proof. We decompose the Hamiitonian (2.13) as foiiows 
H =-h( 2 /*) + -^{io*,Y) + J(Ay,y) + F((k',x)) + R<{x) + i?>(x) + P(x, Ve^). 

e y/£ 2 

where 14((k',x)) contains aii the Fourier modes ofV(x) in the spangjk'}, the remaining 
term R<{x) + R>{x) contains aii the Fourier modes of V(x) in Z”'\spanz {k'}. Given 
6 , iet K = The Fourier modes for k G Z*^ \ span^jk'} with |k| < K are put in 

R< and those with |k| > K are put in i?>. We have the estimate |i?>|r -2 < d. 

Oniy one step of KAM iteration is needed. We use a new Hamiitonian ^/£F whose 
induced time-1 map gives rise to a sympiectic transformation 

H ° i"} + I / V - t){{^,F}, P}(4>*^^) dt 

= ^ + n(k',x)) + (cu*, 1 ^) 

+ i?<(x) -I- R>{x) + P(x, ffieY) 

/ f)P f)J^\ p 

+ \/e(^Ay + —, —^ {l-t){{R\,F},F}{<^>^^p)dt, 

where F solves the cohomological equation 

dF' 


R<{x) -I- ( uj*, 


dx 


= 0 . 


Notice that F is a function of only x. Notice also \P\r = 1 and V(x) = P{x,y*), so 


we get |F|r = 0 {£ ^+ 2 ^ 2 ) as £,6 
the assumption (2.17). 


0 by solving the cohomological equation under 
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Let (5R/ = i?>. As we choose K = 6 and V G we have |R /|,._2 < 0(1). Let 
R„ = P(i. v^F) + ^/;(AF + 5^. 

+ 1 / (1-*){{H,F}.F} d«. 

We have 

( 1 ) |P|r+i < |P/|r-i + IP/zIr-i < 0^6^/^ from formula (2.15) and (2.16), 

(2) using the derivative estimates of F and the fact that Y = 0(1) we find 

(3) since {H, F} = |;^(a;*, Y) + ^(AL", F) + V(x) + P, f|, we find 

Al-t){{H,F},F}((/)*^^) dt =0(5i/V/2-^). 


r-2 


Therefore, we have |R/ 7 |j ,-2 < 0(5^0) and can make the term arbitrarily 

small by decreasing e. The proof is now complete. □ 


2.5. The normal form around double resonance. In this section, we obtain the 
normal form of the Hamiltonian system in a neighborhood of S". 

Proposition 2.2. For all small J > 0, there exists a symplectic transformation (p 
defined on £~^^‘^dh~^{B{ijj*,£^)) x T”, which is 0 (v^( 5 “ 0 +i)) close to identity in the 
topology, \p — \di\r -2 = O(-y/e(5“0+i))^ sending the Hamiltonian H gO^ of 
to the following form provided e is sufficiently small 

^2 Ro 4 >{x,Y) = h{y*) + + ^(Ay,y) + PKT) 

+ V ((k', x) , (k", x)) + SRf(x) + e'^R//(x, Y), 

where V consists of all the Fourier modes in V with k G span^jk', k"}, the remainder 
JR/ consists of all the Fourier modes in V with |k| > K, and the norm of 

V, R/, R// satisfies \ V\r, |R/|r- 2 , |R//|r -2 < 0(1) as e —)• 0 . 


(2.13) 


Proof. We decompose the Hamiltonian (2.13) as follows 

H =-/!(/) + ^(a;*,y) + ^(Ay,y) + y((k',x), (k",x)) 

£ \J£ A 

+ R<{x) + F>(x) + P(x, 'JlY). 


where 


(1) y((k',x), (k",x)) contains all the Fourier modes of V(x) in the span^jk', k"}; 

(2) the remaining term R<{x) + Ry{x) contains all the Fourier modes of V(x) in 

Z"\span 2 {k', k"}. Given 6, we choose K satisfying K = The Fourier 

modes for k G \ spang{k',k"} with |k| < K are put in i?< and those with 
|k| > iL are put in ii>. We have the estimate \R-y\(jr -2 < 5. 
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To run the KAM machine for one step, we use a new Hamiltonian ^/eF whose induced 
time-1 map (f’^p gives rise to a symplectic transformation 


H ° ^VeF =H + V~e{y^,F} + I ^V - F}, 


dt 


Y) + hAY, Y) + H((k', x), (k", x)) 
e 2 

+ yw*, T R<{x) + Ry{x) Y P(a:, \/eY) 
dP dF\ 

+ |^\l-t){{H,F},F}(<h^^)dt, 


where F solves the cohomological equation 


R<{x) + {uj*, — ) -0. 


Formula (2.9) in Lemma 2.3 is used to bound |(k,a;*)| from below for |k| < iL by 
for some constant C independent of 6 ,e- Notice that F is a function of 
only x. Notice \P\r = 1 and V(x) = P{x,y*), so we get |F|r = as e,5 —)■ 0 

by solving the cohomological equation. 

Let (5R/ = F>. As we choose K = 5“^/^ and V G (7^, we have |R/|r -2 < 0(1)- Let 

/ f)P r)J^\ 

e°'R// = Pii{x, V^Y) + ^/£[AY + ^) 


+ y\l-t)mF},F} dt. 


We have 


(1) we obtain from formula (2.15) that < Cr£^, 

(2) using the derivative estimates of F and the fact that Y = we find 

(3) since {H, F} = Y) + F) + V(x) + P, f|, we find 


Al-t){{H,F},F}(0O dt = O (e'^F:2(2-+i)V 

J 0 r—2 


Therefore, we have |R//|r _2 < and can make the term e^Ru arbitrarily 

small by decreasing e. The proof is now complete. □ 


3. The choice of diffusion path: from single, double to complete 

RESONANCES 

3.1. Normally hyperbolic invariant manifold for Hamiltonian system. In this 
section, we introduce the theory of normally hyperbolic invariant manifold (NHIM). 
Besides its persistence under perturbation, we shall use the fact: a Hamiltonian system 
restricted on a NHIM is still a Hamiltonian system with less degrees of freedom. It was 
discovered and used intensively by Delshams, de la Llave and Seara in their geometric 
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theory of Arnold diffusion of a priori unstable type to get a Hamiltonian restricted on 
a NHIC whose time-periodic map is the inner map (cf. |DLS08| 1. 

We introduce the definition of the normally hyperbolic invariant manifold following 
[DLS00[ [DLS08] . As remarked by de la Llave, the definition in |DLSnn[ IDLSOS] is less 
general than that of [E] but better adapted to the symplectic structure. 

Definition 3.1. Let f : M ^ M be a -diffeomorphism on a smooth manifold M 
with r > 1. Let H C M be a submanifold invariant under f, /(H) = H. We say that 
n is a normally hyperbolie invariant manifold if there exist a constant C > 0, rates 
0 < X < < 1 and a splitting for every x G H 

T^M = T^U 

in such a way that 




\Df{x)v\ 

< CX'^\v\, 

k >0, 

veEf 


\DfHx)v\ 

< CX^%\, 

k < 0, 

V G T^U 


\Df\x)v\ 

< C/|a|, 

k €Z. 


Theorem 3.1 (Theorem A. 14 of |DLS00| ). Let Hx C M - not necessarily compact - 
be hyperbolic for the map fx generated by the vector field X, which is uniformly C'' 
in a neighborhood U of ILx such that dist{M \ U,Ilx) > 0. Let fy be the C^-map 
generated by another vector field Y which is sufficiently close to X in the -topology. 
Then, we can find a manifold Hy which is hyperbolic for Y and close to lix in the 
fi-S} fQpgiggy^ j’/jg Lyupunov exponents for Hy are arbitrarily close to those 
of Ilx if Y is sufficiently close to X in the topology. The manifold Hy is the only 
manifold close to Hx in the topology, and invariant under the flow ofY. 


When the normally hyperbolic flow is Hamiltonian, we have the following theorem 
saying that the restriction of the Hamiltonian system to the central manifold is also 
Hamiltonian with less number of degrees of freedom. 

Theorem 3.2 (Theorem 23 and 26 of |DLSn8] l. Suppose M is endowed with a {an 
exact) symplectic form uj Let fs : M ^ M be a family of Hamiltomorphisms, r >2 
preserving u. Assume that H C M is a normally hyperbolic invariant manifold for /o 
with rate X,fj.. 


( 1 ) 

( 2 ) 


Then for sufficiently small e, there exist -families of diffeomorphisms ke, r^ 
with I < min |r, satisfying o feg = fcg o where is the map such 


that /c£(n) = He and rg : H —)• H is the restricted map on H . 

We denote by TZs the generating vector field corresponding to r^ defined by 
= TZe o re. Then we have 

• kfuj = can is a {an exact) symplectic form on H. It is independent of e. 

• The vector field TZe is {exactly) Hamiltonian vector field with respect to 
can. Moreover, its {global) Hamiltonian is Re = Fe o ke where Fe is the 
Hamiltonian for fe. 


3.2. Normally hyperbolic invariant cylinder (NHIC) in the regime of single 
resonances. Using Formula (2.3), we introduce a linear symplectic transformation 
denoted by 

x' = M'x, Y' = {M')-^Y. 

The frequency oJa is transformed to ca(j = M'uja = Aa(a, 0, ^,ca*_ 2 ). 
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In Proposition 2.1, we choose y* G S' such that w' = M'u* has zero as the second 


entry. Applying the symplectic transformation dJl' to the normal form (2.18), we get 
the following system 


911' 


/— 1 * 


H o ct> =-h(/) + Y') + UaY', Y') + V (x' 2 ) 

£ ye z 


+ 6 Rj{x) + e^P+s) Rjj{x, Y), 

where A = M'AM'* and Ri(x') = Ri(M'-^x), Rjj(x',Y') = Wl'-^*Rn(x,Y). 

We denote by Z'{x',y') = Z{{k', x), y) where Z{{'k',x),y) consists of all the 

Fourier modes of P{x, y) in the span^jk'}. From the definition of V ((k', x)) in Propo- 
we get that P((k',x)) = Z{{k',x),y*). 


sition 


2.1 


The following assumption is shown to be generic 1 |CZ1| and Theorem B.l of |C12) 1: 

(HI): For each y e F(j, the function Z' is non-degenerate at its globally maximal 
point, i.e. —d‘^iZ'{x 2 ,y) > 0 holds if x'f is a globally maximal point. The number of 
global maximum is unique except for finitely many y’s there are two global maximums. 

Under this genericity hypothesis, we get that the function V{x' 2 ) has non degenerate 
global maximum point for all y* in a //-neighborhood of F(j and 5 small enough, since 
h = 05-s.o(l) • 


In (2.18), we neglect the remainder ( 5 i? 7 --|-e 2 (r-+ 2 ) Y) to get that the remaining 


system 


H' := h{y*) + + ^iAY',r) + V (x') 


admits a NHIC given by 


n 


0 

n —1 




X 2 = 


ld{AY',Y') 


n 


dYf 


= ^A2^Yf = 0 


(3.1) 

"Vn _ -- 

V ^ ^ 1=1 

The normal Lyapunov exponent does not depend on e or 5. Restricted to the NHIC 


we get a system with one less degrees of freedom due to Theorem 3.2 We denote by 


Y = 7r_2T', 


X = TT-2X 


OJ* = 71 - 20 ;'*, 


and by {AY, Y) the quadratic form {AY', Y') obtained by substituting 1^2 solved from 
(3.1). The restricted system now has the form 


^o = ^^{ui\Y) + \{AY,Y) 

where we have dropped some constants for simplification. Also, it can be verified 


directly with the help of (3.1) that Hq gives rise to the Hamiltonian flow restricted to 
the NHIC. 

The system H (without the linear transformation) is defined in a A ball in the Y 
variables since the homogenization is done in a Ay^ ball. To the locally finite covering 
that we use before homogenization, we associate a partition of unity '^ifiiY) = 1 to 
the homogenized system, where ifiiY) G C°°, ipi{Y) = 1 if |T| < (1 — d)A and is zero 
for |y| > (1 — 0.5(i)A where 2 d is smaller than the Lebesgue number of the covering. 


To apply the NHIM theorem, we replace the remainder 6 Rj Rjj in (2.18) by 

f;i{M"Y){6Ri + Rjj). 


We next apply the NHIM theorems |3.1| and |3.2[ In the Hamiltonian equations, the 
vector field in the center is fast x = ^ 0(1). However, the NHIM theorems are 
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still applicable since the large term ^ is constant, which does not contribute to the 
derivatives of the Hamiltonian flow, since the derivatives of the Hamiltonian flow are 
solutions of the variational equations where ^ does not appear. The time-one map of 


the Hamiltonian flow admits NHIM given by (3.1) satisfying Definition 3.1 where none 
of the constants A, /r, C depends on s. The perturbation to the Hamiltonian is C^~‘^ 
small hence is a small perturbation to the vector field. Only the smallness of 
the perturbation to the vector fields, the normal hyperbolicity and the derivatives of 
the time-T map for some T (can be chosen to be 1 in our case) are needed in the proof 
of the NHIM theorem |Fe| . Applying the NHIM theorems we get a NHIC which is 5 
close to the unperturbed one in the (7^“ ^-topology as the center Lyapunov exponents 
are zero. The NHIC for the modified system agrees with that for the system 2.18 


m 


the region where ijji = 1 and in the overlapping region of two balls in the covering 


the NHICs coincide due to the local uniqueness. In this case, we apply Theorem 3.2 
to restrict the system to the NHIC to get a Hamiltonian system with one degree of 
freedom less 


(3.2) 


H5 = ^{oj\Y) + 1{AY,Y) + 6Ri{x) + Rnix,Y), 

ye z 


where \Ri — Ri\^Yi],r -2 = 05 - 5 ’o(< 5 ^)) i To obtain this form of the Hamiltonian, 

we write the perturbed NHIC as a graph over the unperturbed cylinder 

Y^ = Y^ + 6 Gy{Y,x), x '2 = x' 2 * + 6 G^{Y, x) 

for some Gx,Gy £ the 5 = 0 case corresponding to the unperturbed NHIC. 

Substituting the graph Y^^x'^ to the system 911'“^*H o (f){x,Y) above, we get a 5^ 
correction to the restriction of Ri, i = I,II on the NHIC. 

3.3. Strong double resonances and shear transformation. The number of dou¬ 


ble resonance treated in Proposition 2.2 depends on 6. However, most of the double 


resonances are weak and can be treated as single resonance. The number of strong 
double resonances is independent of 5,e. 


3.3.1. Criteria of distinguishing weak and strong double resonance and the reduction of 
order at weak double resonance. Consider a double resonance associated to the vector 
k". We decompose Z in (2.19) in Proposition 2.2 as 

(3.3) Z((k',x),(k",x)) = Z'((k',x)) + Z"((k',x),(k",x)) 

where Z' includes all the Fourier harmonics in the span{k'} and Z" contains the rest. 

Notice Z" must contain at least one term with k". Since k' does not depend on 5, we 
get \Z"\(jt -2 < for some constant C independent of 5. We first treat Z" + 6R as a 

perturbation to the truncated Hamiltonian , Y) + 5 (AT, Y) + Z'{{k!,x)), which 

has a NHIC following from exactly the same reasoning as the previous section. There is 
a threshold denoted by 5 that is the maximal allowable C^ norm of the perturbation 


for applying the NHIM Theorem 3.1 to the NHIC in the truncated Hamiltonian. 
The threshold <5 does not depend on 5, e so we get when 5 > 2 , we treat the 

corresponding double resonance point as a single resonance, otherwise we treat the 
point as a strong double resonance point. The total number of strong double resonance 
points are bounded by (^)”^^ which is independent of e, 5 for given generic P. 

After the reduction of order and also undoing the homogenization, we obtain a 


Hamiltonian system of the same form as (3.2) with 6 replaced by <5 
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3.3.2. The shear transformation for the strong double resonances. Next, we write the 
homogenized Hamiltonian in the neighborhood of the double resonant sub¬ 

manifold S" into a standard form. We denote by a" the a for which there is a second 
resonance vector k". 


In the next lemma, we are going to introduce a linear symplectic transformation that 
is in SL{2n,'R) but not in SL(2n,Z). The problem with this transformation is that it 
does not induce a toral automorphism on hence not a symplectic transformation 
on T*T^. To hx this issue, we lift r*T” to its universal cover and interpret this 
symplectic transformation as that on We will introduce an undoing-the-shear 

transformation later, so that the composition of the transformations is on T*T^. 


Lemma 3.1. There is a linear symplectic transformation denoted by such 

that the Hamiltonian system Hocj) in Proposition \2.^ is reduced to the following 
Hamiltonian defined on up to an additive constant 


(3.4) 


=G(x,y) -h G(y„_ 2 ) Pi{y) + 5Ri(x) + s'"Rii{x,y), 


where 


(3.5) 


G(x,y) =^^iy,y^ + ^(x) 

G(yn-2) =2 {yn-2, Byn-2) + —^{djS,n-2,yn-2 


where ojs = (6, 0 )^- 2 ) = SM"M'lWa" = M"M'coa", for S in (3.12). The two matri¬ 
ces A and B = {A — A^A~^A) are positive where A, A, A in's? , K 2 x(n- 2 )^ ■^{n- 2 fi 
respectively form the matrix 


(3.6) 


A = 


A 

A^ 


The remainders Rfiy) = © 

l^/(x)lc’-2) I-R//(x)Ic’-2 = 


—1 * gjj //—1 * —1 * 


501' i = I, II, P/, R/, R /7 are in (2.19), with 


0 (1) as £,6 ^ 0. 


Proof. We use the linear symplectic transformation denoted by 501"901', 

(3.7) x" = M"M'x, Y" = 

We also keep track of the frequency vector uj'f = M"M'oJa = M"uj'a = (u(a), 0, *,..., *) 
where ^{a) depends on a linearly and u(a") = 0. 


We get a Hamiltonian system 

Hf :=(9Jl"-^)*(9Jl'-^)*Ho(/. 


(3.8) 


=^(a;"*,y") + 2(7l"y",y") + Vixfix'') + P';(y") 
+ hiY*) + dR'Iix'') + e^^R'iiixfi Y") 


by applying 9J1"901' term by term to (2.19). In the following discussion, for the sim¬ 
plicity of notations and without leading to confusion, we also remove the ". Let us 
denote 


(3.9) 


G(y,x) = ^{uj{y*),Y) + ]^{AY,Y) + V{xi,X2), 
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where the first two entries of uj{y*) are equal to zero since we have fixed the base point 
of homogenization Y* G Y" before Proposition 2.2 In this system, the variables Yn -2 


are cyclic so we can treat this system as a product of an integrable system with a 
system of two degrees of freedom. Moreover, we can separate variables by introducing 
a shear transformation. 


We write the matrix A in block form of (3.6). We also denote v = (^ 1 ,^ 2 ) as the 


first two entries of a vector v G M”". Next we have the following formal derivations 
G{Y,x) + 1{AY,Y) + V{x) 

= ^{AY,Y) + {Y,AYn- 2 ) + V{x) 

+ 7:{AYn-2,Yn-2) H- 1= {<^n-2iYn-2) 

I ye 


(3.10) 


= ]-{A{Y + A-^AYn- 2 ), {Y + A-^AYn- 2 )) + V{x) 


1 


1 


1 


~ 2{AYn-2, A AYn-2) + ~;j^{AYn-2,Yn-2) + (w„_2, W,-2 

= \{A{Y + A-^AYn-2), {Y + A-^AYn-2)) + V^x) 

Y —{Yn-2, (A — A^ A ^ A)Yn-2) Y £ {Cjn-2,Yn-2) ■ 

We perform the following linear shear symplectic transformation denoted by ©, 


(3.11) 


y 


id2 

A-^a 

Y 

yn-2 


0 

idn-2_ 

Yn-2_ 


X 


5^ 

to 

O 

_1 


X 

_Xn-2 


id„_2. 


in—2 


so that the homogenized system in the new coordinates is written in the form G = G+G 
stated in the lemma. We denote 


(3.12) 


5 = 


id 2 0 

idn-2 


s-^ = 


id 2 A ^A 

0 idn-2 


so that the above symplectic transformation & simplifies to x = Sx, y = S ^Y. 

Since A is positive definite and the linear symplectic transformation S does not 
change the signature so we get both A and B = {A—A^ A~^ A) are positive definite. □ 

Remark 3.1. Notice the above matrix S is identity in the x component, hence the 
Hamiltonian G depends on x periodically. We ean project the domain of G from to 


r*T^. So in the following, we think G as a Hamiltonian defined on T*T^ 


3.4. NHIC in Hamiltonian systems of two degrees of freedom. The system 
(3.4) is defined in a 0(e'^“^/^)-neighborhood of the sub-manifold of double resonance. 


We pick a large number A independent of e and consider two different regimes in the 
space of action variables: the low energy region and the high energy region. The low 
energy region is defined to be a Ae^/^-neighborhood of the submanifold Y" ( 2 . 8 ) in 


the original scale (a A-neighborhood in the homogenized system). The complement, 
i.e. the part of action space that is Ae^/^ away and within distance from Y”, is 
called the high energy region. 
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In the low energy region, we consider a covering in the following way. In the system 
H 5 of (3.4), we cover the low energy region by balls of radius 2A centered at some 
point y* G S". we perform the Taylor expansion around y*. Formally we get the same 
expression as before with slight modification of the values of A and uJn -2 by 0(8^^), 
however, the new Pi(y) term, i.e. the third order and higher terms in y — y*, now has 
estimate |iA|c’’ < since |y — y*| < A and there is a -y/e factor in Pj. So we absorb 
this Pi{y) term into the Rn terms in (2.19). Later we will allow A to depend on 
5, in which case, the estimate \Pi\c'^ < y/e still holds as e —>■ 0. So in the following 
discussion for the low energy region, we forget about the Pj term. 

We cite the following theorem from |C12j (see also [C15b| i 


Theorem 3.3 (Theorem 3.1 of |C12j i. Consider a Tonelli Hamiltonian H defined 
on T*T^. Given a class cq G FI^(T”,M), if the minimal measure is supported on a 
hyperbolic fixed point, then there exists an n-dimensional flat Fq C iL^(T”',]R) such 
that this point supports a c-minimal measure for all c G Fq. 



Figure 3. Two ways that the flat Fq connects to the channels 

In the following, we specialize to the case of n = 2. Each critical point x of the 
function V corresponds to a fixed point (x, 0) of the Hamiltonian flow <I>|. Especially, 
each non-degenerate maximal point determines a hyperbolic fixed point. Let J denote 
the standard symplectic matrix, it is obviously a generic property that 

(H2.1): The function V attains its maximum maxE = 0 at one point x = 0 only, 
where the Hessian matrix of V is negative definite. All eigenvalues of the matrix 
Jdiag(A, D^E) are different: —A 2 < —Ai < 0 < Ai < A 2 . 

Under this condition, for fixed yn- 2 ) the system G is a Hamiltonian system of 
two degrees of freedom. The a-function of G has a 2-dimensional flat Fq as the set 
of its minimal points. For each class c G intFo, the minimal measure is uniquely 
supported on the fixed point, which has at least three minimal homoclinic orbits with 
different homological class. These homoclinic orbits lie on the intersection of the stable 
and unstable manifold of the fixed point. As the Hamiltonian G is autonomous, the 
intersection is not transversal. However, for convenience, we still call the intersection 
transversal if the tangent space of the stable manifold and that of the unstable manifold 
span the tangent space of the energy level at the intersection point. 

We denote by A^ = {Axi,Ayi) the eigenvector corresponding to the eigenvalue Aj, 
where Axi and Ayi are for the x- and y-coordinate respectively, then the eigenvector 
for —Aj will be A“ = (A^,*, —Ayi). It is also a generic condition that 
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(H2.2): The stable manifold of the fixed point (0,0) interseets the unstable manifold 
transversally along each minimal homoelinie orbit. Each minimal homoclinic orbit 
approaches to the fixed point along the direction Ai; 'y{t)/\\'y{t)\\ —>■ A^;! as t ^ ± 00 . 

Under these two conditions (H2.1, H2.2), certain uniform hyperbolicity on mini¬ 
mal periodic orbits is proved in Lemma 4.1 (for energy close to zero) and Appendix 
B of |C12j and |CZ1[ (for all energy levels). 

Theorem 3.4 (Lemma 4.1 and Theorem B.2 of [C12j ). Consider a C^{r > 5) generic 
Tonelli Hamiltonian H with two degrees of freedom. Suppose on the zero energy level 
set, there is a hyperbolic fixed point supporting an action minimizing invariant measure 
and assume (H2.1,H2.2) for the fixed point. Given a class g G Lfi(T^,Z) and any 
V* > 0, there are finitely many Vi G (0, v*] only sueh that for u = Ui there exist exactly 
two vg-minimal periodic orbits, for other v G (0, iz*] there is only one izg-minimal 
periodic orbit. These periodic orbits are simultaneously hyperbolic, they make up one 
or more pieces of normally hyperbolic invariant cylinder. 


In particular, we obtain the generic property that, for the class g = (1,0), all ug- 
minimal periodic orbits are hyperbolic simultaneously for all v G (0 < 1^0 < 

vi < -|-oo). This Hamiltonian G as well as the Lagrangian L (obtained from G by the 
Legendre transformation) has been extensively studied in our previous work |C12] . Let 
us briefly review some results in |C12| if the potential V satisfies generic conditions 
(H2.1, H2.2). 

(1) each first homology class g G Lfi(T^,Z) determines a channel in iL^(T^,M), 
C{g) = .^js^izg) where is the Legendre-Fenchel transform, connected to the 
flat Fq such that for each c G C{g) the Mather set A4(c) consists of periodic 
orbits {7,7} with [7] = g. This channel admits a foliation of lines C{g) = UIe 
with Ie C aG^(£') such that M.{c) = Ai(c') if c,c' G Ie for some E G [ 0 , 00 ). 
For any E* > 0, there are finitely many Ei G {0,E*) such that the Mather 
set consists of two periodic orbits if the first cohomology class c £ lEi and 
the Mather set contains exactly one periodic orbit ( 7 e, 7 e) if the cohomology 
class does not he on these lines {lEi}- All these periodic orbits are hyperbolic; 
(See (H4) and Appendix B of |C12] .l 

( 2 ) there are two ways that the channel connected to the flat, either at a point c{g) 

or along an edge E(g'). In former case, the Mane set J\f{c{g)) consists of the 
fixed point and two homoclinic orbits (7i,7i) and the periodic orbit 

( 7 E) 7 e) approaches these two homoclinic orbits as E ^ 0. In this case, two 
integers ki,kj G N\{0} exists such that g = ki[^i] -|- kj[^j]. In latter case, the 
periodic orbit ( 7 e, 7 e) approaches a homoclinic orbit or the period remains 
bounded as i? —)• 0. (See Section 3.3 of |C12| .l 



3.5. NHIC in the intermediate region between single and strong double 
resonance. In this section, we deal with the high energy region. We need to take the 
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term P/ in (2.19) into consideration. Consider the part in (|3.4|) 
(3.13) 


G + P/(y) = ^{Ay,y) + V{x) + Pi{y). 


3.4 


to G with the frequency line ^ = 


dy 


We fix yn -2 as parameters and apply Theorem 
u(l,0), |u| G for some large d > 0 independent of e. Namely y £ := 

(^(G + P/))“^(u(l, 0)). We cover the curve using balls of radius C centered on Ty 
for some large constant C independent of e. We denote by y* £ any one of such 
centers and consider the Taylor expansion of G in a neighborhood of y*. Denoting 
y = y — y*, x = x, we get a mechanical Hamiltonian of two degrees of freedom up to 
a term depending on yn -2 only, 


G{x,y) = + ^{A'y,y) + V{x) + Pj{y,yn- 2 ), 


{x,y) G X 


where wi G [0, and Pj is the third and higher order terms of the Taylor expansion 
of Pj in y. By the choice of y* we have |y* — y*| < e ^~2 . Clearly, we have 


A' = A + 


d‘^Pi 

1)^ 


(y*) ■■= 


4' 4' 

/ill 


4' 4' 

/il2 /is 


12 
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transformation in Lemma 

1 


Because of the special form of Pj (see (2.14) from which Pj is obtained by a linear 

d^Pi, 


3.1), each entry of ^^(y*) is bounded by 0{e^) provided 


|y* — y*| < £^~2 . Morover, since |yn- 2 | can be as large as ^ also cover the 

n — 2 dimensional space of yn -2 using balls of radii C. Restricted in each such balls, 
we assume |yn- 2 | < C* in P'j, hence = 0{y/~£) as e —>■ 0. 


We first apply Theorem 3.4 to get a normally hyperbolic invariant cylinder. More 
details are given in |C15aj . 

We need to make sure such hyperbolicity is uniform for all e > 0 so that we can 

add perturbations by applying the NHIM theorem. The content of the next lemma is 

2 2 

explained in the example H{x,y) = ^ ^ + (cosx 2 — 1), for which the cylinder with 

homology class (1,0) is given by {{xi,yi,X 2 = 0,y2 = 0)} whose normal Lyapunov 
exponents are ±1 regardless how high the energy level is. The key idea in the proof 
of the next lemma is that for high energy level the xi variable in G is fast rotating 
hence is averaged out and yi becomes nearly constant. The remaining system for 
X 2 , ^2 is a system of one degree of freedom which generically admits a hyperbolic fixed 
point, whose hyperbolicity is independent of e. The normal hyperbolicity is given by 
the hyperbolic fixed point of the subsystem 2 ^ 222/2 + V(xi,X 2 ) dxi. 


Lemma 3.2. Consider (3.13). For all parameters yn -2 with |yn- 2 | < , for 


gen eric V, the Lyapunov exponents in the normal direction of NHIC given by Theorem 
3.4 is bounded uniformly away from zero for all \v\ £ for any given large 

constant d and for all £ sufficiently small. For large enough D, the NHIC is a whole 
piece without bifurcation. 


Proof. As usual, let be the Hamiltonian flow produced by G and let = $L |j=i 


G 


'G' 


What we need to show the cylinder H that we obtained by applying Theorem 3.4 
satisfies Definition 3.1 Since the cylinder is foliated by periodic orbits, the center 


Lyapunov exponent is zero. We only need to show that the normal Lyapunov exponent 
is bounded away from zero, independent of e. 
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Let 0 = -\/e we consider the case when ri ;:!> 1. In this case, the energy of 
the Hamiltonian G is of order 0(Q) if ||y|| < C, where C = 0(1) is independent of e. 
Under the symplectic coordinate transformation 

(3.14) {xi,X2,yi,y2) ^ X2, ^ 2 ), 

the Hamiltonian G turns out to be 

G = yi + ^^^112/1 + ^^i 2 yiy 2 + 2^222/2 + ^ X 2 ^ + Pj - 

Before proceeding, we explain the meaning of the rescaling in the xi, yi components. 
Our goal is to show the normal hyperbolicity of the time-1 map of the original Hamil¬ 
tonian system G. Performing the energetic reduction, we want to treat yi as the new 
Hamiltonian and xi as the new time. However, since |xi| is large in the high energy 
region, we introduce a rescaling such that the rescaled xi, considered as the new time 
has the same time scale as original time. The rescaling is not essential here. 

The equation G(xi,yi(xi,X 2 , 2 / 2 ), 2 / 2 ) = EVL is solved by the function 
yi = EVL- ^yl - A[^Ey 2 -V + 

Let r = —xi as the new “time”. Note ^ = 1 -|- 0(11”^), the normal hyperbolicity 
of at the energy level set {G = TTl} is equivalent to the normal hyperbolicity of 
^^jT=o( 27 r)- The Hamiltonian yi produces a Lagrangian up to an additive constant 

Ti = ^(^2)2 - ^X2 -v + L!- 1 Lr, 

^^22 ^22 

where Ln is C^^^-bounded in {y 2 ,x,X 2 ) for all H > 0. The minimal periodic orbit of 
the type-(i^, 0) for is converted to be minimal periodic orbit of . As it was shown 
in mu, the hyperbolicity of such minimal periodic orbit is uniquely determined by 
the nondegeneracy of the minimal point of following function 

I'Qj It: 

E{x 2 ,Pl,E)= inf f Li(j{T),'y{T),QT,E)dT. 

7(0)=7([G]f )=X 2 Jo ^ ^ 

A' E 

As we consider periodic orbits only, the term jf ±2 does not contribute to F, so we 

^22 

drop this term. Let 7 n,E(E, X 2 ) be the minimizer of F{x 2 , E), i.e., the action along 

this curve is equal to F{x 2 ,^, E), | 7 r 2 ,£;(E, X 2 )| is uniformly bounded for 0, G [flo,oo). 
As the system has one degree of freedom, ll“^-periodical in r, the minimal point of 
E determines an n“^-periodic curve 7^^;. We shall see later that | 7 q£;(t)| —>■ 0 as 
n —>• 00 . 

Although the Lagrangian Li depends on 11 in a singular way as H —?• 00 , the function 
F appears regular in H as H —)• 0. To see it, let us expand V into Fourier series 

u(llr,X2) =-Uo(x2)+ Vk{x2)P^^P 

kez\{o} 
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By the method to prove Riemann-Lebesgue’s lemma and using the boundary condition 
7n,F;(0,X2) = we find that for A: / 0 




Vfc( 7 n,E(r, X 2 ))e'‘'^ dr 


- / 

ikO, Jq 




<- 


ikVl 

1 


Vk{'yn,E{T, X 2 ))'yn, e{t, X 2 )e'''^ dr 

. C 

{VklljuMdr < 


' |A:|n Jq ' \k\^^} 

where the last inequality is obtained by assuming V € . So, for r > 2 we find 


F{x 2 , E) = Fq{x 2 , VL,E) + Q, ^Fr{x 2 , E), 


where 

/I 2 \ 

^0 = (^^^(7o,i?('r,X2))^ + Ro(7n,£;(r,X2))jdr. 

By perturbing the potential R —)• R + Vs(x 2 ) we can assume generically that > 0 
holds at the minimal point of Rq. Consequently, we have | 7 ^ £;(f)| —)■ 0 as n —>• oo. 

We claim that some /i > 0 and fio > 0 exist such that 5^^ ^ ^ holds at the minimal 
point of F{-, n, E) for all n G [rioj oo). As a matter of fact, the minimizer 7 o,_e(f, X 2 ) 
is uniquely determined by X 2 provided X 2 stays in certain small neighborhood of the 
minimal point of F{-,Q,E) (see Section 2 of [CZlj l. It follows that the initial velocity 
7n,F;(0,X2) smoothly depends on X 2 - Direct calculation shows that 


d^Fo 

dxl 


+ 




Jo 


1 ^ 


A'22 V dx2 




1 


d^in,E , 

-1Q.,E r. 2 -^ 

0 X 2 


/ ^7o,g 
v dx2 

5^7!: 


dr 


a . * .0 'yn,E\, 


where the second integral approaches zero as D —>■ 00 . Indeed, we have |7j2 £;('^)l 
as D —7- 0 and | 7 q £;(t) — X 2 I —)• 0 where X 2 is the minimal point of Vq. It follows that 
^( 7(1 s(f)) —>■ 0 as D —)• 0. To estimate the first integral, we note that the minimizer 
7 ^g(r) stays in small neighborhood of the minimal point of Vq, certain d > 0 exists 


(independent of D) such that ■^Vo{'yQ e) — ^ holds for all r G [0, [D]^]. As is 

the solution of the variational equation 


d-yn,i 


1 / ^7n,E \ _ d'^v d'jn^E , o _ q 

2^22 dr^ V dx 2 / dx 2 8 x 2 


with the boundary condition ^gy^(O) = ([^] ^) = !> where 


R — Rq 


d'yn,E 

dx 2 


+ i?i 


d 070,E 
dr 8 x 2 


+ e^i?2 


d?_ 

dr'^ 


8^n,E X 

8x2 / 


\Ri\co is uniformly bounded for D > 0, certain T > 0 exists (independent of D) such 
that > \ for all r G [0, T] U [[D]^ — T, [D]^]. These arguments lead to the 

conclusion that certain ^ > 0 exists, independent of D such that 

^i"o(7o,E(0),D,i?)>2/i 

holds for all large D ^ 1. As the function of action T is a 0(D“^)-perturbation of Fq, 
for all large D ^ 1, 8 ^^F > fj. holds at the minimal point of F(-,Q,E). 
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Given a large n, let X 2 be the minimal point of F and drop the notation of n. One 
has 

F{x2,E)-F{xl,E)>^x{x2-xl)\ 

if \x 2 — 3 ^ 2 ! suitably small. Let Be '■= u~ — denote the barrier function where 
are the backward and forward weak KAM solutions (see the appendix A), as it was 
shown in [CZlj . one has 

Be{x2) - Be{x*2) > F{x2, E) - E{xl, E). 


As barrier function is semi-concave, there exists a number Cl > 2 ^ such that 


BE{X2)-BE{XI)<CL{X2-Xlf. 


It guarantees that the hyperbolicity of the minimizer is not weaker than A = y 1 ~ ^ ■ 

Let us assume the contrary, denote by ( 7 E('r), 7 ^(r)) the minimal periodic orbit and 
denote by ( 7 ^(t), 7 ^(r)) the orbit such that 7 “( 0 ) = 7 ^( 0 ) and they asymptotically 
approaches to the orbit ( 7 *('r), 7 *(r)) as r —)• ± 00 , we then have 

l7E(±j) -7'^(±j)l > MlEiHj - - l))l 

if |7|;(0)—7^(0)| is suitably small. The following computation leads to a contradiction: 
Cl( 7^(0) - 7 e( 0))2 > BEil^m - BEilW)) 


> 


00 


j=i 


+ 


-{j)- 


> 2 fj. 


(7^(0)-7e(0))^ 

1-A2 


= Ci(7^(0)-7^(0))' 


Since E is bounded uniformly for > 0 and appears in the remainder Q such 

normal hyperbolicity holds for all E we are concerned about. 


Notice that in the above argument, the normal hyperbolicity for high enough en¬ 
ergy level is completely determined by the hyperbolic fixed point of the subsystem 
H 2 {x 2 -,V 2 ) = - Vo{x 2 ) where Vo{x 2 ) = - fp V(xi,X 2 )dxi and IA 22 - ^' 32 ! = 

0(6°^) for all |yn- 2 | < ■ We get the uniform normal hyperbolicity for all |yn- 2 | < 

£^^-1/2. For generic V, the function Vq has unique non degenerate global min, in which 
case the NHIC stays in a neighborhood of the set {{xi,y 2 , X 2 = argminVb,y 2 = 0)} 
for high enough energy levels, hence the NHIC has no bifurcations. □ 


Finally, we formulate the following lemma summarizing the way we apply the NHIM 
Theorem IQ and 13.21 


Lemma 3.3. Suppose the Hamiltonian system H satisfies (Hl.l, HI.2, H2) along 
the frequency line segment uja = Aq ^a, 2, . Then there exists A(k' ) > 0 

depending only on k' but independent of 6,£, being the maximal allowable norm of 
the perturbation, such that for each 6 < A(k'), where 6 is in Proposition 


2.1 


and 


2.2 


Theorem 3.1 and\3.2\ are applicable to the truncated system in a p-neighborhood of the 


frequency line coa to get the existence of NHIC in the full system in all the single, weak 
double resonant, and the strong double resonant case. 


Proof. The existence of such A(k') in the single resonance and weak double resonance 
case is given by (Hl.l). In the double resonance case, there are sub cases. In the low 
energy region for E < A for some large A, the uniform normal hyperbolicity follows 
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from Theorem 3.4 immediately. In the high energy region E > A the uniform normal 
hyperbolicity is given by Lemma 3.2 □ 


3.6. Center straightening and reduction of Hamiltonian systems with two 
degrees of freedom. We consider a Tonelli Hamiltonian system of two degrees of 
freedom H{x,y), {x,y) G t*T‘^ and assume (0,0,0,0) is a hyperbolic fixed point on 
the zero energy level supporting the Mather set of rotation vector 0. We fix a class 
g G i/i(T^,Z) then consider := ug {v ^ 0). We get that the Mather set is 
supported on a periodic orbits except that for finitely many u’s, the corresponding 
Mather set is supported on two hyperbolic periodic orbits. We get a cylinder foliated 
by hyperbolic periodic orbits ( 7 ,^, 7 ^/) as the Mather set of Aig^- Let Hq 
( recall is the Legendre-Fenchel transformation TM -G- T*M). Since each periodic 
orbit is hyperbolic, we get Hq is a normally hyperbolic invariant cylinder of dimension 
two. 


The normal hyperbolicity gives rise to the following decomposition of the symplectic 
form (Equation (63) of [DLSOSj i 




0 


0 

(3.15) 

n = 


0 

0 



0 

0 

n 

1 _ 


with respect to the decomposition of the tangent space 

T^M = © £■“ © X G Hq. 

Especially, the symplectic form H restricted to the cylinder is still a symplectic form. 
Einally, according to Weinstein’s neighborhood theorem (Theorem 3.30 of |MSj ). we 
can always find symplectic coordinates (I, ip, u, v) in a neighbourhood of the periodic 
orbits, such that (/, (p, 0,0) corresponds to the NHIC. A neighborhood of the NHIM 
is realized as a normal bundle. The symplectic coordinates (I, ip) in the NHIM is 
given by the next proposition. After that we consider the fiber coordinates. Notice 
dimE“ =dimE'^ = 1, we can write = du A dv where u is the unit covector of 
and V is the unit covector of E^. We choose u, v as the fiber coordinates. 

The next proposition holds true for all homology classes g G LIi(T^,Z). However, 
when applying the proposition in this paper, we choose exclusively g = ( 1 , 0 ). 

Proposition 3.1. For a Hamiltonian H G C'’'(r*T^,M) and a homological class g G 
we assume that 


(1) H{0,0) = 0. The Hamiltonian flow takes {x,y) = (0,0) as its hyperbolic 
fixed point, the point AfH{0,0) supports c-minimal measure for all c G Fq; 

(2) Some Ei > 0 exists such that for each c G Ie G1 C{g) with E G (0, Ei) the 
Mather set AA{c) eonsists of a unique periodic orbit {ae-iIe) 'with [ 7 ^;] = g. 

Let Hq = U {(7_E)7f;) ^ E G (0,Ei)}, then restricted on the cylinder Hq, there 
exists a symplectie ehange of variables $ : (I, y?) G M x T —)> (x,y)|no; sueh that the 
Hamiltonian H can be written as 


^*H = Ho^ = h{I), 
where h G has the following properties when / > 0 

m = o, = §>o. 


d^h{I) 

dl^ 


> 0 . 
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Assume furthermore that H is reversible, i.e. H{x,y) = H{x, —y), then h{I) is also 
reversible, h{I) = h{—I). 


Proof. Clearly, Ho is diffeomorphic to a piece of cylinder, which is a symplectic sub¬ 
manifold as one has 

r fEi t‘T(E) 

/ n = / / dE A dt > 0 

Juo Jo Jo 

where T{E) is the period of the periodic orbit lying on the energy level set H~^{E). 
Let ririo be the restriction of standard symplectic form on the cylinder. Denoted by 
n = T X M the standard cylinder and let II —?• IIo be a diffeomorphic map. Then 
the pull back of Dno, TgDno is a symplectic form on the standard cylinder Ho. As 
the second de Rham cohomology group of cylinder II is trivial, Moser’s argument on 
the isotopy of symplectic forms shows that certain diffeomorphism T: II —?• II exists 
such that 

= dxA dy. 

The Hamiltonian H induces a Hamiltonian defined on H: E[^Q^{x,y). 

Let us consider a Tonelli Hamiltonian H{x,y) where {x,y) G T x M. By adding a 
closed 1-form and a constant, we can assume that its a-function attains its minimum 
at the zero first cohomlogy, i.e. ani^) = mina/f = 0, the Mather set is a singleton of 
hyperbolic fixed point located at the origin (x, y) = (0,0). For Fi > 0, the equation 


(3.16) H{x,W{x,E)) = E, VxGT 

has exactly two global solutions W^{x, E) > W~{x, E). These two solutions produce 
the separatrix and Sq : Eg = graph(VF=*=(-, 0)) if Fi = 0. 

In the region above the separatrix (or below the separatrix Eg ), one can in¬ 
troduce new angle-action variable {I, if) so that H{x,y) = E{I). The coordinate 
transformation {x,y) -A (F, y?) is determined by the generating function S{I,x) which 
solves the Hamilton-Jacobi equation 



Since the Hamiltonian is strictly convex in y, this equation determines exactly two 
global solutions if Fi > mina^, denoted by W~^{x,E) and W~{x,E). The graph of 
W~^{x,E) is above the separatrix E^, the graph of W~{x,E) below the separatrix 

Eg-. 

Let us consider the region above the separatrix E^, where the generating function 

S{I,x) = [ W+{x,E{I))dx, 


produces a coordinate transformation Ti: {x,y) -A {E uniquely determines F) 

(3.17) y^^=W*{^,E), j f.w*(x,E)dx. 

From Section 50 B and C of |A89j . the action variable F = I(x,y) is defined as the 
area bounded by the curve graph(IF'''(-, E)) and the curve 

(3.18) 1(E) = (w^(x,E)-W^(x,0)^dx. 

The symplectic change of coordinates is the composition of these maps 
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It remains to show the twist. We use a result of Carneiro |Ca] saying that Mather’s 
/3 function is differentiable in the radial direction for autonomous systems. Now h{I) 
is actually Mather’s a function since Mather set is exactly the periodic orbit The 
direction of vg is the radial direction as v varies. The a function is strictly convex 

^ > 0 , a.e. in order that f3 is differentiable. 


dh{I) 

dl 


dl Jo dP 



d^h{t) 

dP 


dt > 0. 


Since the symplectic transformation is explicit, we get that h € C^. 


Finally, to see the system h{I) is reversible, we notice that the reversibility of the 
system H{x,y) implies that the Mather sets with rotation vectors ug and —i^g, ly > 0 
are supported on the same periodic orbit with reversed time. Since the Legrendre 
transform of an even function is also even, so we get the Lagrangian L{x, x) is even 
with respect to x, hence P = get a negative sign we reverse the time. The two 
periodic orbits lie on the same energy level and their corresponding action variables are 
opposite to each other from the formula I = § y dx. This completes the proof. □ 


3.6.1. The estimate of the derivatives of the canonical symplectic transformation near 
strong double resonance. When we are getting close to the strong double resonance 
point the symplectic transformation in Proposition |3.1| becomes singular. Let us show 
how that symplectic transformation in the homogenized coordinates behaves when the 
energy of the periodic orbit is getting closer and closer to zero. 


Lemma 3.4. Consider the Hamiltonian H{x,y) in Proposition 3.1. Then, 
level greater than small E > 0, we have the following estimates for the 
transform 4* given by Proposition\3.1\ 


for energy 
symplectic 


(3.19) 


d{x,y) 

9(1, p) 


< di \ InFil, 


(3.20) 


max < 

0 <ki<k [ 

d^x 


d^y 

1 ^ 

dp^^dP~^^ 

1 

dp^^dP~^^ 

j — jyjk-l+S 


where the constant 6 > 0 is small and d^ > 0 is independent of E. 


Remark 3.2. Because of this lemma, we consider only the part of phase space with 
|/| > 7 > 0 for some 7 independent of d,e. Hence we have E = h{'y) > 0. This 7 will 
be chosen in Section 16.31 


Proof. Recall, in the region above the separatrix S)}", new action variable I = I{x,y) 
is introduced as the area bounded by the curve graph(iy'“(-, E)) and the curve 


I{E) = ^j^ (w^{x,E)-W^{x,0)^dx. 

In (/, (/j)-coordinates, the speed of p remains constant and approaches to zero as E 
decreases to zero while the speed of x is bounded away from zero when (x, y) is not 
close to the hyperbolic fixed point. This fact reminds us of the singularity of E{I) as 
i? — 7 > 0. Let us explore how singular it would be. Since the function W~^ solves the 
equation (3.16) for each E > 0, we have 


dE 


1 


f27r o I 

—W+{x,E)dx = — 


r2TT 


dE 


271 


dH 

dy 


-1 


,W+{x,E)'^dx. 
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For each E > 0, dyH = x remains away from zero when (x, y) is not close to the fixed 
point. For small Fi > 0, the main contribution to the quantity ^ is the integral over 
a small neighborhood of x = 0. In a small neighborhood of the hyperbolic fixed point 
the Hamiltonian takes a form 

1 

y) = -y"^ - y x^ + H^{x, y) 

where H^{x,y) = 0(|| (x, y)p). Therefore, in a small neighborhood of the fixed point, 
the energy level line is the graph of the function 

(3.21) y = V^E + A2x2 (^1 + Wi (x)) , x E [-5, 5] 


where IFi(x) = 0(|x|). Consequently, we have 

1 dx 

= - \ lnE + Bi{E) 

7TA 

where Bi depends on 6 and A but remains bounded as -E —0. 




dx 


27r J_s V2.S +A2x2(l + ITi(x)) 


To explore higher order singularity of / on Fi, we do further calculation 

d& 


‘27r 


dH 

dy 


+ 


1 


|^ 2 n 


F2k-2 


dH 

dy 


d^H 

dy"^ 

-1 


(x,IF+(x,.F))dx 


dx 


where = (—l)”“^(2/c — 3){2k — 5) • • • 1, E 2 k -2 is a polynomial of {2k — 2)-order in 
dy^H, its coefficients are the functions of the derivatives of H. To obtain an estimate 
on the F-th derivative of F in E, we make use of again the formula (3.21) and the fact 
that dyH = X remains away from zero when (x, y) is not close to the fixed point. For 
the case k > 2 we find 


(3.23) = Ak{E)E^-\l + Bk{E)) 

where the functions and B^ depends on y and d but remains bounded as Fi —)• 0. 
Indeed, we have 


Ak{E) = 



Guaranteed by the strict monotonicity of F in Fi (see (3.22)), the function E = E{I) 
and its inverse I = I{E) are well defined. The identity E{I{E)) = E yields 


dE dl _ 
~dIdE ^ 


dE 

~dl 


dE] 


-1 


which in turn produces inductively the expression of second derivative 

d^E _ ( d^I 

~dP~~\dE) 


and higher order of derivatives. We thus obtain that 


(3.24) 


dE 


d^E 

< 

d'u 

dl 

|lnF;|’ 

dH 


\\nE\^+^E^-^ 


V F > 2, 
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where the constants > 0 (fc = 1, 2...) are independent of the energy E > 0. 

With these formulae we are ready to estimate the C^-norm of the coordinate trans¬ 
formation T: {x,y) —7- This transformation is determined by the generating 

function S{I,x) which solves the Hamilton-Jacobi equation 



Since the Hamiltonian is strictly convex in y, this equation determines exactly two 
global solutions E > mina^, denoted by W~^{x,E) and W~{x,E). The graph of 
W~^{x,E) is above the separatrix the graph of W~{x,E) below the separatrix 
Sg . These two solutions satisfy an identity 


(3.25) 


dH dW^ 
dy dE 


Let us consider the region above the separatrix where the generating function 


S{I,x) 


W+{x,E{I))dx, 


produces a coordinate transformation Ti: (x,y) —)> (/,</?) {E uniquely determines I) 
determined by formula (3.17). Restricted in the region where Li > 0 we obtain from 
the identity (3.25) that 


didx \ dy J dl 


Therefore, equation (3.17) well defines the coordinate transformation {x,y) 
such that 


(I, 


dtp \dl j \dEdx) ’ 

dx _ (d‘^SdE dSd'^E / d^S \ 

m ~~ [uj j \dEdx J 

dip \dl J dx"^ ’ 

dy _ d‘^S f dE d‘^S 
dl dEdx y dl ^ dx'^ \dl ) j 


Indeed, these equations are derived as follows. As I uniquely determines E, we can 
write the generating function S{I,x) = S{E,x). For the equation p = |j, think x,y 
as the function of I, p and take differential for p on both sides, we have 

d'^S dx d'^S dx dE 
dIdx dp dEdx dp dl' 

From which we obtain 

^ (—Y^ 

dp \dEdx) \dl) 

Take differential for I we have 

d^S UeV dSd^E d'^S dEdx 

^ ~^~dP ^ dEdxHm' 
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from which we obtain the second equation. From the equation y = ^, we obtain 


^ ^ / d‘^S 

dip dx'^ dtp dx'^ \dEdx) \dl 

that is the third equation. By the same method, we can get the fourth equation. With 

in 

□ 


-1 


the help of these formulae as well as (3.24) we hnally obtain the estimate stated in 
the lemma. 


3.7. Reduction of order and undoing the shear near strong double resonance 

in the full system. In this section, we perform the reduction of order near strong 
double resonance in the full system. There is one special thing that we need to do 
is to undo the symplectic transformations © since the matrix S is not necessarily 
unimodular matrix of integer and may destroy the resonance relations. 


Begin with the system (3.5), we apply Proposition 3.1 to (xi,X 2 ,yi,y 2 )-component 


with homology class g = ( 1 , 0 ) to get new coordinates ks{I,‘p,u,v) = (xi,X 2 , yi, y 2 ) so 
that the cylinder takes the form G M x T, {u,v) = 0}. Since {u,v) = 0 on the 

invariant cylinder, where ks is the restriction map in Theorem |3.2[ we have 

x{I,ip,U,v) =X*{I,ip) +Xr{I,V,U,v), 

(3.27) 

y{I, ip, u, v) = y*{I, ip) + yr{I, ip, u, v), 

where |xr|/||(n,u)|| — )• 0 and |y,.|/||(ri,u)|| —>■ 0 as ||(w,u)|| —)• 0. Extending ks in natural 
way to all variables, denoted by ,ip,u,v,Xn- 2 ,yn- 2 ) = (x,y, x„_ 2 , yn- 2 ), we get 


the Hamiltonian .^^145 in the variables {I,u,yn-2', P’,v,Xn-2), where H 5 is in (3.4). 
We have H 5 = (©“^)*H'^' from (3.4), where is in (3.8). The variables {x",Y'') (we 
omit " for them in the following) of undergo the following transformations by .^ 5 © 
(see (3.12) for ©) 


(3.28) 


X 


Xn —2 


Y 

r 

Yn- 2 , 



id2 0 
id„_2 

id 2 —A~^A 
0 idn-2 


x{I, V, ip, u 

^n —2 

y{I,v,ip,u) 

yn -2 


Put Formula (3.28) into the fourth equality of (3.10), notice yn -2 = Yn -2 and apply 


Theorem 3.2 and Proposition |3 .1 1 we get a Hamiltonian 


(3.29) 


=^{Ay{I,ip,u,v),y{I,ip,u,v))) + V{x{I,ip,u,v)) 

1 1 

+ Pl{y{I,^,U,v),yn-2) + -{yn-2, Byn-2) + ^(Wn-2,yn-2) 

+ As{SRi{x) + e'^R//(x,y)) 

=h{I) + ^ {uin-2,yn-2) + A}Pl{I, ip, U, V, yn- 2 ) 

Ye 

+ 2 {yri-2, Byn-2) + A.§{ 6 Ri{x) + e'^Rji{x, y)) 


+ 


^{Ay{I,ip,u,v),y{I,ip,u,v)) - ^{Af{I,ip),y*{I, ip)) 


+ [y{^{I,^,u,v)) -V{P‘{I,ip))]. 


By formula (3.27), the sum of the last four terms in the bracket in above formula, as 


well as its first derivatives, is equal to zero when u = u = 0. By the condition assumed 
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the set 

n = {{I,^p,U,V,kn-2,yn-2 ■ {u,v) = 0} 

is a NHIC for the Hamiltonian flow of ^|H 5 . Restricted on the NHIC, the above 
system is a system of n — 1 degrees of freedom of the following form 


(3.30) 


=h{I) + {(2}n-2,9n-2) + k} Pi{I, if, yn- 2 ) 

+ ^ {yn-2,Byn-2) + k}{6Ri{x) + e'^R 7 /(x,y)) 


where kg is the restriction of .^5 to the NHIC. 

Next let us introduce another symplectic coordinate transformation (undoing the 
transformation ©) since in the current coordinates the system is not necessarily peri¬ 
odic in Xn -2 due to the shear transformation 6 , 


(3.31) 


V’ 

r 

u 

= 

Xn-2. 


J 


V 

= 

<> 

s 

1 

to 

1_ 



id2 0 
id„_2 


id2 —A 
0 idn_2 


u 

Xn-2 

I 

V 

yn -2 


The composition of transformations (3.28) and ( |3.31 ) produces the following relation 

x(( J, v) A~^A\n-2,'i9-, u) 

X„_2-i*i-‘(V^,u)‘ 


(3.32) 


X 


id2 

0 

Xn—2 


idn-2 

Y 


id2 —A~^A 

1 

to 


0 

idn-2 


y((J,v)-hii ^1 y„_ 2,V’, u) 
Yn-2 


Be aware of the fact that x* denotes a closed curve with homological class [xq] = (1,0) 
(see Formula (3.27)), we have 

x(/, ip + l,u,v)= x(/, yj, u, v) + ( 1 , 0 ), y(/, ip + l,u,v)= y(/, u, v), 


we get following from (3.32) 

x{J,ljj + 1 , U, V,X„_2, Y„_2) =x{J,^p,U,V,Xn- 2 ,'^n- 2 ) + ei, 
y(J, ■0 + 1 , U, V,X„_2, Y„_2) =Y(J, V’, U, V,X„_2, Y„_2). 

For k > 3 we also have 

x{J, V’, u, V, Xn-2 + efc, Yn-2) =x{J, V’, u, V, Xn-2, Yn-2) + Cfc, 
U, V,Xn-2 + efc, Yn-2) =Y{J,lj;, U, V, X„_2, Yn-2). 


where efc G M"" is a unit vector, whose k-th entry is equal to one, all other entries are 
equal to zero, efc G is defined in the same way. Consequently, the composition 


of the transformations (3.28) and (3.31) produces a map on T*T' 


Next we restrict to the NHIC H to obtain a Hamiltonian H of one less degree 

of freedom. The restriction is done by setting u = v = 0, which implies 


u = n = 0 , y + {r] 2 , Yn- 2 ) = v = 0, 

if we use rji to denote the i-th row of the matrix A~^A for i = 1,2. We substitute two 
equations back to ©^R^Hg to eliminate u,v from our list of variables. The resulting 
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function denoted by H is still Hamiltonian following from Theorem 3.2 The reduced 
system H has the form 


(3.33) 


H =h(J,Y„_2) + P/(J,V’,Y„_2) 

+ SRl{'ll^,Xn-2, J,Xn-2) + Xn-2, Yn- 2 ), 

where the main part h takes the form 

(3.34) h = h{J + (r7i,Y„_2)) + ^tDn-2)Y„_2^ + - (Yn-2, B'Vn-2 

The perturbation term Rj depends on the new action variable in somehow special way 

R/(V',X„_2, J, Y„_2) = R7(V',Xn-2, J + (r/1, Y„_2)) 

similarly for P/, and we have |R/|r -2 = 0(1)) |R//|r -2 = 

To formalize the above procedure of order reduction, let ns recall the operator 
introdnced at the beginning of Section]^ Denote by 7r_2© the symplectic transforma¬ 
tion obtained from S by applying 7r_2 to both the action and the angular variables, we 
have 7r_2(S“^) = (7r_2©)“^. One can verify that H = ( 7 r_ 26 )*A;|H 5 dne to u = n = 0. 

3.8. Frequency tracking and refinement. Let us recall the transformation for the 


frequency shown in (2.2) 


UJa M'uJa M”M'uJa, 


where M' is dehned in (2.3) and M" is defined implicitly in Lemma 2.2 

Assume at certain a = a" there is the second resonance k" G Z"', i.e. (k", M'uja") = 0 
by Lemma 2.2 For small Ao = a — a", we have 

(3.35) M"M'a;a = M"(a" + Aa,0,^,w;_3y = (o, 0, + k+Aa 

where k^ is the first column of M" . Notice that the second entry o f th e frequency 
oj = M”M'oJa is zero since the second entry of k^ is zero (see Lemma 2.1). 


Next, we apply the transformation 5, which does not change the first two entries 
of the freqnency vector, so the first two entries of the frequency vector SM”M'oja is 
(A:|Aa, 0), where k\ is the first entry of k^. 

there we have chosen the homological class s' = (1,0) G Lfi(T^, Z 


Recall Section 


3.6 


and the Hamiltonian system of one degree of freedom h{I) has periodic orbits achieving 
all freqnency zz(l,0). Therefore, the frequency line SM"M'u)a lies in the image of the 
frequency map of the system o cj) where H o is in 


Proposition 2.2 


Next, we have undone the transformation ©, so in the system has the 

frequency line M"M'oJa in (3.35). Finally, after the reduction of order during which 


we eliminate the second entry 0 from M"M'uja, so in a neighborhood of the frequency 
line Tr- 2 {M"M'uJa), the Hamiltoinan system H is defined. 

After one step of the reduction of order, we are handed with a Hamiltonian system 
of (n—1) degrees of freedom H, for which we need to choose a new path to approximate 
the frequency line (2.2). We further undo the transformation to transform the 


system H to i/ := (7r_25!?t'0*H) where 7r_2501'' is again to apply 7r_2 to both the 
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action and angular part of the symplectic transformation iUf". The new system H : = 
{71-2^')*^ is defined in a /r neighborhood of the frequency line 


(3.36) 


7r_ 2 M'Wa 


(7r_2M"-i)(7r_25-^)7r_2(5M"M'a;„) = Aa(a, 


dp 

qQ 



t 


To relocate the frequency line, notice the matrix M' does not change the fourth 
entry of Ua, we pick a rational number which is /x-close to ul denoted by | satisfying 


(3.37) 


P 

- - W4 

q 


< P, g-c.d.{p,q) = 1 , 


use the line segment of frequency coa ■= Aa(a, |,a>*_ 4 )* G 
the line segment 7r_2M'a;a = Aa(a, ^,w*_ 3 )h 


Dn—1 


to approximate 


2.1 


since c <)*_4 G DC(n —3,a,T). 

to 


This frequency vector has the form of Ua in Section 
We set ~ satisfying g.c.d.{P,Q) = 1 and consider some large K{> K) 

distinguish whether the second resonance is strong or not. Now we are in a situation 
completely parallel to Section 2.1 if we endow a bar to all the notations and get rid 
of one dimension in all the vectors. Again we encounter the situation of single and 
double resonances. The hrst resonance relation is given by k' = —qP,0n-4:)/d 

before acted by M” where d = g.c.d.(gP,p(3). 


As we vary a, double resonance may appear. Some of the double resonance points 
are new while some are inherited from that before the reduction of order. Next, we 
show that a strong double resonance vector before the reduction of order always gives 
rise to a strong double resonance after the reduction of order. 


Lemma 3.5. Each second resonance vector k" with (k",a;(j") = 0 at some a" gives 
rise to a second resonant vector k" := 7r_2(k") := 7r_2(k"M'“^) for the reduced fre- 
queney vector Ua satisfying (k",a)a//) = 0, where a” = a" — — |)- The vector 

k"(7r_2M"-i) = ei = (l,0,...,0). 


Proof. Consider at a double resonance point a" we have 

{k'',Ua") = 0 

for some vector k" G Z"’ \ span{k'} and |k"| < K. We have 

(k"M'-\MW') = 0. 

Next denote by k" := 7 r_ 2 (k"M'“^). Since we have changed 7 r_ 2 M'u;a to COa by chang¬ 
ing the Diophantine number to a rational number |, suppose k" = (fei, ^ 3 ,..., kn), 
then we have 

{k",uja") = 

We then move a" slightly to a” = a” — “ f) ^kat we get (k", uja") = 0. 

Since the first row of M” is k", one has k" = 7 r_ 2 k" = ei. □ 


The total number of double resonance points is finite depending only on K. Most 
of these double resonances are weak. The total number of strong double resonances 
is finite and does not depend on K. 

As for the frequency line reduced in the case of double resonance, we only need to 
convert this approximation, by the map M", to the coordinates under consideration. 
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We keep it in mind that the approximation of u)\ by a rational number ? shown in 


(3.37) applies to all frequency line segments. 


3.9. Normal form after one step of order reduction around strong double 
resonance. The Hamiltonian with n degrees of freedom is reduced to a system with 
one less degree of freedom. To continue, we need to handle three possibilities when 
we are going to reduce the system further to two less degrees of freedom; 


(1) for the Hamiltonian (3.2) inherited from the case of single resonance, if it is 


still at single resonance, we apply Proposition 2.1 


(2) for the Hamiltonian (3.2) inherited from the case of single resonance, if it is a 


newly born double resonance, we apply Proposition 2.2 


(3) for the Hamiltonian inherited from the case of double resonance, it is always 


at double resonance due to Lemma 3.5, so we apply Proposition 2.2 


The resonance sub-manifolds can be defined again for the reduced system, denoted by 
S', S". They are (n — 2) and (n — 3)-dimensional respectively. 


2.2|is applicable. Recall 
centered at 


A number /2 is given by Lemma |2.3[ depending on the size of the remainder <5, such 
that in the neighborhood B{Cda,fi), either Proposition 2.1 or 
we have a covering of the neighborhood H(S",e°') by balls of radius e° 

S", which becomes a covering by balls of radius in the homogenized system. 

We have that S" is a codimension one sub-manifold in S". The covering descends to 
a covering of a neighborhood of S" after the reduction of order. We consider only 
those balls intersecting B(uia,fl) and discard the rest. In each of the remaining balls, 
we perform further KAM iteration and reduction of order. The balls are deformed by 
the linear symplectic transformation for strong double resonance and by 911' for 

single or weak double resonance, but the norms of these transformations are bounded 
by a constant independent of 6 ,S,£. 

Next, we obtain a normal form in a neighborhood of a double resonance point that 
is succeeded from that before the reduction of order. 


We define 
(3.38) 


2 _ \ ( P P P 

I 0.) ~i ~Zi 4 , 

\ ^ Q Q 


as the frequency segment for the system (1.1), where the superscript 2 counts the 


current step of the reduction of order and approximation of frequency vector. We 
have that the frequency vector Wq obtained in the previous section is related to 
through 

TT_2{M'uI) = Oa- 

The frequency has three resonance relations given by k',k" and k'^, where k'^ = 


(7r+2k')M' ^ for a = a" given in Lemma 3.5 


Now we determine the triple resonance sub manifold as 

S'" := {y I (k',a;(y)) = (k",a;(y)) = (k'^a;(y)) = 0} C S". 

with frequency w* G S'" of the 


2.2 


We work on the Hamiltonian (2.19) in Proposition 
form for some a. We have the following normal form. 

Proposition 3.2. For all small J > 0, there exists a symplectic transformation cj) 
defined on {B{u}*, e^))) x which is close to identity in 
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the C'' ^ topology, |(/> —id|r -2 = 0{y/e5 sending the Hamiltonian Ho^ E ^ 

of (2.19) to the following form provided e is sufficiently small 


(3.39) 


H o 0 o 0(x, Y) =hiY*) + ^y) + liAY, Y) + V ((k', x) , (k", x)) 
e ye z 


+ Pi{Y) + 6 V{{k',x),{k",x),{k'^,x)) 

+ MR/(x) + e°'R/7(x, y), 

where the first five terms in the right-hand-side of ( |3.39 ) are the same as that in (2.19) 
of Proposition 3.2 with the same notations, V consists of all the Fourier modes in R 
in (2.19) of Proposition 3.2 m span^jk', k", and the remainder 55Ri contains all 
the remaining Fourier modes in JRi. Moreover, we estimate the norms 

l^lr, \V\r-2j |R/|r-4, |R//|r-4 < 0(1), aS £ <C 5 <C (5 — )• 0. 


Proof. The proof is almost the same as that of Proposition 3.2 so we only point out 
the difference. As uj* satisfies (k',w*) = (k",a;*) = 0, we get 

(k,a;*) = {kM'-\M'uj*) = (7r_2(kM'-i), 7r_2(M'a;*)), V k E Z’", 

since the second entry of M'u* is zero. Due to the choice of the J1 neighborhood, we 
have 7r_2(M'a;*) E B{Cda",h)- So part (2) of Lemma 2.3 is applicable to -n- 2 {M'oj*). 
Hence we get the estimate 

I/k > _ Q • infa Aq _ 

K ;i - 2^+i(gQ)^+i(||M'||oo^)2^+i 

holds V k E Z"'\span{k', k", k'^} with |k| < A' = The proof of Proposition 

goes through with this estimate. 


3.2 


□ 


3.10. Reduction of order around triple resonance. In this section, we perform 
the reduction of order around the triple resonance. 


3.10.1. Linear symplectic transformation. We have defined matrices M' E SL{n,'L). 
We introduce two matrices M 2 , M 2 E SL{n,Z) as follows 


M '2 


id2 0 0 0 

Q Qe 0 

a a 

0 f s 0 

000 id„_4 


= 7r+2M', M'2 


k" 

62 1*^2 02x(n-3) 

*n-3 0(n-3)x2 *(n-3)2 


where the vector k^' E M”', first row of M' 2 ', is defined from the relation k^'M^M' = k", 
O 2 = (0,0), in -3 is an (n — 3) dimensional vector and the * entries are defined as 
follows. In M 2 ', we remove the second and third columns and rows to get a matrix 
of (n — 2) X (n — 2). Then from the resulting first row, we find n — 3 integer vectors 
spanning unit volume with the first row. The matrix M' 2 ' determined in this way is in 
SL{n,Z). We next define 

M = M' 2 'M' 2 M' E S’L(n,Z), 

By definition, the first three rows of M are k",k' and k'^ respectively. The matrix M 
induces a symplectic transformation 

m: {x,Y)^{Mx,M-^Y), iWa = Mul, A = MkM\ 

where oJa has vanishing first three entries if a = a" is such that is at triple 
resonance. We denote Aa = a — o", so we get Ua = coa" + Aoei. 
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By the symplectic transformation one obtains the Hamiltonian defined on 
H {U o (j) o 

( 3 . 40 ) =-h{Y*) + + \{AY,Y) + V {xi,X2) 

eye l 

+ 5V (xi,X 2 ,X 3 ) + PiiY) +55Ri{x) + e°'i? 7 -/(x, H), 

where Pj = similarly for Rj,Rif. The matrix M depends on 5 through k'^ 

but independent of 5. The transformation changes the norms of i?/, Rn- However, 
since we can choose 6, e as small as we wish, the remainders can be arbitrarily small. 


3 . 10 . 2 . Shear transformations. We next introduce the shear transformation as we did 
in Lemma 3.1 to block diagonalize A. Let A, S3 G 5 L(n,M) be defined as follows 

id3 


A = 


A 3 A 3 
Ai A3 


83 = 


^3^3 * 


0 

idn-3 


where ^13,^3, A3 are in and respectively. With the shear matrix 

we introduce a transformation 

63: {x,Y) {S3X,S^^Y) := {x,y), 

which transforms the Hamiltonian into the following system defined on T*T^ x 
H53 :=e3^*im"^*(Ho0o0) 


= -h{Y*) + 


( 3 . 41 ) 


^(^373,73) + H(x) + 6V (X3) 


T 3) Yn— 3 ) T rj (-^SYn—3; Yn—s) 

\/e z 


+ {66&^^*Ri{x) + e'^ 63 -^*i?,,(x, y) + Pi{Sly)) , 

where we denote B3 = {A3 - and X3 = (xi,X2,X3), y3 = (Yi,y2,Y3), 

X = (xi,X2). The norms of the matrices 33,83 depends on 5 but not on 5 . We 
next block diagonalize the quadratic form (^393,93) by introducing one more shear 
transformation 

id2 0 0 

- 33^-1 1 0 

0 0 idn-3 

where 33 = (031,032) G is the vector formed by the entries of A on the third row 
to the left of the diagonal. We can verify that 

i 0 0 


^2 = 


( 3 . 42 ) 


^2 


H 3 0 
0 B 3 


Si = 


0 63 0 

0 0 H3 


where 63 = 033 — 33H ^33. Denoting S = S2S3 and & : (x,y) 1—>• {Sx,S *y), we get 
the following system defined on T*T^ x M2(n-2) 

H 5 ■.=6-^*m-^*{Ho(j)o{i)) 


( 3 . 43 ) 


= lh{Y*) + ^(i9,9) + ^ (x) + |yi + (52-1^3) 

T 3) 9n— 3 ) T 

+ PiiS^y) + 6-1* {66Ri{x) + e^Rn{x,y)) 
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(3.44) 


The frequency line now takes the form 

SMua =SM{u>a" + Aaei) 

= (0, 0, 0, LJa"n- 3 ) + Aok^ + Aa(0, 0, —s, On-s), 

where = (*, 0, 0, in-s) is the first column of ^ 2 - Notice that the frequency line 
has 0 as the second entry. 


3.10.3. Reduction of order. In the above system Hg, we apply Theorem |3.4| with ho¬ 
mology class g = (1,0) and find NHIC in the subsystem G := \{Ay,y) + V (x) since 
the current frequency line (3.44) has zero second entry. Restricted to the NHIC, the 
subsystem G is reduced to a system of one degree of freedom denoted by h(I) in 
action-angle coordinates. We restrict to the region |I| > 7 for some small number 7 
to be determined later. 


Applying Theorem 3.2 one reduces the system to the following 


--I 1 


^ 3 , 2 


(3.45) 


—e h{Y*) + h{I) + ~^y3 + ^ 3 ) 

^ (cj^_3,y^_3) -|- —{B^yn—3^yn—3) 

+ Pi+ {66Ri{I, <y9, Xn-2) + yin- 2 , yn-2)) , 


where Z = kg{S ^*)R is obtained from {S ^*)R by substituting xi{I,(p),X 2 {I,(p). 
Similarly for Pf,R[, Rjj. 

3.10.4. Undoing the shear transformation. Now we get a subsystem h{I) + ^y‘^+5Z of 
two degrees of freedom. However, this system is not defined on but on T*T^ x 
To fix this issue, we can use the undo-shear method in Section 3.7 to get back a system 
defined on T^T”. In this section, we only perform a partial undo-shear transformation 
to get a system defined on T*T^ x 

We introduce the following undo-shear transformation 

1 0 0 


5 = 


G SL{n — 1, ] 


s 1 0 

0 0 id„_3 

where s is the first entry of 33 ^.“^ that is also the minus of the (3,1) entry of S 2 . 

6 : {(p,X‘i,Xn-3-, I,y3,yn-3) ^ {S{ip,X‘i,Xn-3)-, S~\l,y^,yn-3)) 

(3.46) = {ip, Sip -h X3, x„_ 3 ; I - sys, y 3 , yn- 3 ) 

: = (</?,x3;xn-3; J,y3;yn-3)- 

Under this transformation ©, the Hamiltonian becomes H 5 := (S)“^*H 


1 , 


(3.47) 


H 5 :=e ^h{Y*)+ h{J + sy^^) +-h^iy^ + 5Z{J + sy:i,ip,xz - sip) 

+ e-^/\Qn -3 ! Yn—3) T 2 ^'^^Yn— 3 ) Yn—3) 

+ Pl + 66Rl{J + Sy3,p>,X3,Xn-3) + £''Rll{ip,X3\Xn-3', J,y3',yn-3)- 


The same calculation as in (3.32) in Section 3.7 shows that the subsystem in the 
bracket is defined on r*T^. Therefore, the system H 5 is defined on T*T^ x 
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The frequency line now is obtained from (3.44) by removing the second entry 0 (first 
reduction) and replacing the third entry —s by 0 (undo-shear). 


3.10.5. Further reduction of order. Again we need to distinguish the low and high 
energy regions. In the low energy region, the Pi term is 0(\/e) hence is neglected. 


Similar to what we met in Lemma 3.1, for the low energy region, we get a product 


of an integrable system of n — 3 degrees of freedom and a Tonelli system of two degrees 
of freedom 

(3.48) G 3 ((^, X 3 , J, y 3 ) = h{J + sy^) + ^ 637 ! + + ■SYa, 2:3 - s(p). 


This system is perturbed by 55Ri -|- Rn- We will apply the procedure of order 
reduction to this system. Namely, we apply Theorem |3.4| with homology class g = 
(1,0) to get a NHIC and restrict the system to the NHIC to get a system of one degree 
of freedom. We stick to the (1,0) homology class since the current frequency line has 
zero second entry. 


3.10.6. Estimates of and s. We claim that 

(3.49) 63 = constb|k'^p, s = consts|k'^|. 


where the constants are independent of <5, constb > 0 and const* G M. 


Recall the definitions of 63 (see (3.42)) 


h = 0,33 - 33^ 


and s is the first entry of 33 ^“^. The (z,j)-th entry of A = MAM* is niilKrrd- where 
mi,mj are the i-th and j-th rows of M respectively. Since the first three rows of M 
are k", k', k'^ respectively, we get that 


63 = k'2A(k'2)* - (k'2AK*)(KAK*)-^(KA(k'2)*), 


and s is the first entry of k'^AK*(KAK*) where we denote by K the matrix of 2 x n 
whose two rows are k*' and k' respectively. Now s is estimated easily as const.|k'^| 
since AK*(KAK*)“^ does not depend on 6 . 


We focus on 63 in the following. Since A is positive definite, we decompose A = CC* 
for some C G GL(n,M) and denote k'^C := k, KC = K. This gives us 

= k(k* - K*(KK*)-iKk*). 


Now, we recall the Gauss least square method. The equation K*x = k*, though in 
general not solvable for x G M^, we can seek for a least square solution given by 
>^is = (KK*)“^Kk*, which has geometric interpretation as follows. The vector Kx^* = 
K(KK*)-iKk* is the projection of k to the linear space spanned by the column vectors 
of K*. Hence (k* — K*(KK*)“^Kk*) is the projection of k to the orthogonal complement 
of the linear space spanned by the column vectors of K*. We see from the construction 
of the vectors k",k',k'^ that forms a nonzero angle with the plane span{k',k"} 
independent of <5, since as g ^ 0 one has 

k' = {0,Qp/d,-qP/d,6n-4) || (0,1,-|£, 6„_3) ( 0 , 1 ,-W4Q/P, On-3), 

Qp 

from which k'^ = ( 7 r+ 2 k')M'“^ is defined and the matrices K,M',A do not depend 
on 6 . This linear independence relation is preserved by the linear transformation C. 
Hence we get that 63 = c|k'^p for some constant c > 0 and independent of 5. 
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3.11. Further genericity conditions. In this section, we talk about the genericity 
conditions that are needed during the second step of reduction of order. 


3.11.1. Genericity conditions in the regime free of strong double resonances. For the 
frequency vector = Aa(a, associated to the original system (1.1), 

suppose there are only two resonance vectors k', V!^. 

Let Z 2 ((k', x), (k'^, x), y) := Z((k', x), y) + (iZ((k', x), (k'^, x), y) be formed by the 
Fourier modes of P{x,y) in the span^jk', k'^}, where Z' agrees with that in (HI) in 
Section 3.2 We can introduce a matrix M G SL{n,X) whose second and third rows 
are k', k'^to induce the following symplectic transformation 

(x,y) HA (Mx,M“‘y) := (x,y) 

In the new coordinates, the function 

Z 2 ((k',x), (k'2,x),y) ^ Z'(x 2 ,X 3 ,MV) = Z'(x 2 ,MV) +<^^'(x 2 ,X 3 ,MV) 


We introduce the following non degeneracy condition. 

(H 1)2 Uniformally for all y G F(j 2 := the function Z 2 (x 2 ,X 3 ,y) has 

unique non degenerate global maximum, namely, the Hessian of Z 2 {x 2 {y),x^{y),y) in 
(x 2 ,X 3 ) is negative definite, where (x 2 (y), X 3 (y)) is the global maximum, with finitely 
many exceptions for whieh there are two non degenerate global maxima. 


The genericity of this condition (Hl )2 is guaranteed by the following result. 


Theorem 3.5 f |CZ2) i. Assume M is a closed manifold with finite dimensions, G 
C’'(M, M) with r > 4 V C £ [Co; Ci] o,nd Fi^ is Lipschitz in the parameter Then, there 
exists an open-dense set QJ C C'’(M, M) so that for each V € it holds simultaneously 
for all ( G [Co, Cl] l-hat the minimum of Fc_+V is non-degenerate. In fact, given F G QJ 
there are finitely many Q G [Co,Ci] such that + F has only one global minimal 
{maximal) point for C 7^ Ci l-wo global minimal {maximal) point if C = Ci- 


Next, we show that the genericity condition (Hl )2 that is imposed on the original 
Hamiltonian (1.1) descends to a version of (HI) imposed on the reduced system of n—1 
degrees of freedom. As r(j lies in a Os^o{fJ-) neighborhood of r(j 2 , the non-degeneracy 
condition (HI) still holds on the curve F(j 2 by the implicit function theorem. For 
each M*y G r(j 2 , suppose the global maximum of Z' is achieved at X 2 (y) (if there 
are two, we choose either of them). We substitute this to the expression of Z 2 to get 
5Z'(x2(y), X3, M*y) plus a function of y only. For the resulting function of X3, y, we can 
^pply (HI) to show that the global maximum with respect to X 3 are non-degenerate 
for all M*y G r(j 2 generically. Suppose this global maximum is achieved at the point 
X3(y). However, the point (x2(y),X3(y)) does not necessarily achieve the global max 
of Zfi Instead, the implicit function theorem implies that the global maximum of Z 2 
is achieved at a point (x 2 (y), X 3 (y)) which lies in a 0 {6) neighborhood of (x 2 (y), X 3 (y)). 
Indeed we first see that |x 2 (y) — X 2 (y))| = 0{6) since Z 2 is a <5 perturbation of Z', next, 
if we modify X2(y) by 0{6) in Z'(x2(y), X3, M^y), the non-degeneracy implies that the 
global maximum is achieved at a point that is 0 {6) close to X3(y). 


Similar to Section 3.2, the global maximum of Z 2 gives rise to a NHIC of dimension 
2(n — 2) whose normal Lyapunov exponents are ±0(1) and ±0{Vd) as <5, e —)• 0. We 
also get such a NHIC by applying the procedure of order reduction to the system (3.2) 
restricted to the NHIC of dimension 2{n — 1). The two NHIC stays 0{6) close to each 
other by the previous paragraph, hence coincide by the local uniqueness. 
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3.11.2. The genericity condition on new strong double resonance. Suppose for some a, 
there exist three resonance vectors {k', k", k'^} to associated to the original system 
( 1 . 1 ), where k',k" give rise to a strong double resonance for uja before the first step 


of reduction of order. 

We need to impose a new genericity condition on the original Hamiltonian system 
(1.1) to show that the reduced system (3.48) satisfies Theorem 3.4 Here we do not 


talk about the analogues of (H2.1,H2.2) since the spectral gap condition therein is 
satisfied automatically in the following steps of reduction of orders and we stay away 


from the strong double resonance by a distance 7 (see Remark 3.2). Consider the 
bracketed system in (3.41) 


H. = 


X (^ys, ys) + H (k) + (ka) 1, (kg, ys) G T*T^ 


which is obtained from the system (1.1) by homogenization, picking out Fourier modes 
in span^jk', k", and a shear linear transformation which is identity applied to X3. 


(H 2)2 Assume V G (^^(T^,]^) (r > 5) satisfies (H2). For generic V G 
and small enough 8, the action minimizing invariant measures with rotation vector 
1 ^( 1 , 0,0), \v\ G [ 7 , 00 ), V 7 > 0, are supported on one hyperbolic periodic orbits except 
for finitely many v’s with two hyperbolic periodic orbits. These periodic orbits makes 
up finitely many pieces NHICs. 


Proof. The proof of the genericity of the above hypothesis is almost the same as that 
of |CZlj . Here we sketch the main steps and point out the necessary modifications. 

When 8 V = 0, the Hamiltonian H 3 is free of X3. Therefore, one can treat it as a 
system with two degrees of freedom. It follows from |CZ1] that there are finitely many 
pieces of NHIC extending to zz —)• cx). For any rotation vector of the form (=t=, =t=, 0) the 
minimal measure is supported on a manifold which is the product of these NHICs with 
the cylinder T x M 9 (x 3 , ys). This manifold survives small perturbation 8 V, due to the 
theorem of normally hyperbolic invariant manifold. By the upper semi-continuity of 
Mane set with respect to small perturbation and the normal hyperbolicity one can see 
that the minimal measure with rotation vector of the form (*,*, 0 ) is still supported 
on the NHIMs (four-dimensional cylinders) provided 8 V is small. 

Restricted on this four-dimensional cylinder we obtain a reduced Hamiltonian with 
two degrees of freedom. Consequently, each minimal measure with resonant rotation 
vector must be supported on periodic orbits. Back to the original system, the minimal 
measure with rotation vector zz(l,0,0) is supported on periodic orbit. So, we will do 

Step 1. By the standard energetic reduction, we reduce the system H 3 to a system 
of 2.5 degrees of freedom by choosing yi as the new Hamiltonian and —xi as the 
new time. Write the action functional defined on the space of loops with homology 
class (1,0,0) and further restrict to a function Fe{>^ 2 ,'X- 3 ) smooth in X 2 ,X 3 and E by 
minimizing the action among curves with fixed initial and final points (x 2 ,X 3 ). The 
global minimum of F£;(x 2 ,X 3 ) gives rise to the periodic orbits supporting the Mather 
set. We next show the generic non-degeneracy of the global minimum of Fe{>. 2 ,>‘- 3 ) 
for E in an interval [FioiOo) corresponding to the interval of u. 

Step 2. The union of periodic orbits make up a cylinder. We pick a section of the 
periodic orbits with energy in [E^, — d,E^ + d] for suitably small d. Without loss of 
generality, we suppose the section is given by the xi,X 3 torus. 
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Step 3. One way to construct the perturbation is given in |CZ1| , which has straight¬ 
forward higher dimensional generalization. The idea is to straighten the flow in a 
neighborhood of the periodic orbit and construct the perturbation in such a way that 
it is the linear combination of the first several Fourier modes in the disk transversal 
to the periodic orbit. The advantage of the construction in |CZ1] is that the potential 
added to the Lagrangian is also added to the action function explicitly as a function 
of X 2 ,X 3 . 


Step 4- The next and crucial step is to use the smooth dependence on parameter 
E to show that the non-degeneracy of periodic orbits holds for parameters in a in¬ 


terval [E^ — d,E^ -\- d) for suitably small d. This is an application of Theorem 3.5 


Then by taking an open cover of \Eq,E^ for some large Ei, the compactness of the 
interval [Eq,Ei\ gives that only finitely many perturbation is needed, which gives the 
genericity. 


Step 5. Since the Hamiltonian H 3 restricts to a system (3.48) of two degrees of 


freedom and the minimizing periodic orbits are also minimizing periodic orbits for 


the reduced system (3.48), we get that the periodic orbits are also nondegenerate 


for the reduced system (3.48) (a non degenerate positive definite matrix remains non 
degenerate when restricted to a linear subspace). Nondegeneracy implies hyperbolicity 
of periodic orbits following from the same argument as [CZ1| or the proof of Lemma 
3.2 applied to the reduced system (3.48). 


Step 6. To show the uniform hyperbolicity for the energy intervals [i?i,oo) for 
large Ei, we apply the argument of Lemma |3.2| to the reduced system ( |3.48| ) to 
get that the normal hyperbolicity is determined by the hyperbolic fixed point of 
the system -|- 5[Z]{x^) where [Z]{x 3 ) = fji Z(oo, (p,X 3 — sip) dip. To see that 
Z(oo,-,-) is not singular, we recall that Z is obtained from V by restricting to the 
NHIC parametrized by (I, ip) and formed by action minimizing periodic orbits of the 
system (3.13) with homology class (1,0). For high enough energy level, i.e. |/| large 


enough, the action minimizing periodic orbit of the system (3.13) approaches the circle 


(yij y 2 = 0, X 2 = argmax f V (x)dxi) in the norm (See Proposition 3.1 of |C15a| L 
hence (I(x,y), ip(x,y)) —)> (yi,xi) in the norm. Since V does not depend on yi, we 
get Z’s dependence on I is weaker and weaker as |/| gets larger. □ 


4. A HIERARCHY OF CANONICAL TRANSFORMATIONS 

In this section, we summarize the above algorithm of the reduction of order induc¬ 
tively. The contents in this section are purely algebraic without analysis or dynamics. 


4.1. Induction at the single and weak double resonances. We are given a vector 
a )*_3 = (w)!,..., u!*) € DC(n — 2, a, r) and use the notation 

uj*_j := ...,LU*)e DC(n-j + l,a,T), for j = 3,4,..., n - 1. 

Suppose we start with a Hamiltonian system with j = 0,l,...,n — 3, 

(4.1) Hj(xj,yj) = ^(u;*,yj} + ^(Ajyj,yj} + <5jR7j(xj) + Rnj(xj,yj), 

defined in a ball of radius 1 centered at 0 with 

(1) satisfies for |k| < Kj, k G Z"’, and Kj = 
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( 2 ) lies in a fj,j neighborhood of the frequency line 


= Aaj a, 


Pj+2 Pj+3 
Qj+2 ’ qj+3 


1 ^n— 7—3 


vn-j 


(3) and the constant fij is determined from Lemma 2.3 by a constant S that can 


be made as small as we wish. Denote this 6 by Sj. 

(4) We adopt the following convention of labeling entries of a vector 


Vj = (vi,Vj+2,Vj+3,---,Vn) 

for any vector vj € , which may represent xj, yj, oja , etc ., 

(5) when j = 0, = 77 and = E_ Those with j > 0 are determined 

Wj + 1 W Qj + 3 Q 

inductively in the following. The torus Tg is the standard torus M”/Z” and 
that for J > 0 is also determined inductively in the following. 

( 6 ) Aj is positive definite and is independent of 6 j. 


As a varies in an interval, we always have a hrst resonance vector 
kb = (q Qj+‘2Pj+^ <lj+3Pj+2 


(4.2) 


di 


di 




where dj =g.c.d.{qj-\- 3 Pj+ 2 ,Pj+ 3 Qj+ 2 }- For some a, we have a second resonant vector 
denoted by We apply Proposition 2.1 in the case of single resonance to get the 
normal form for arbitrarily small dj+i 


(4.3) 


r 

+ Sj+iRi,j+i{^j) + ef Pii,j+ii^j,yj)- 


This normal form is defined in a ball of radius 1 centered at yt whose corresponding 

frequency in fj,j+i), where /rj+i(<C fij) is determined from 6 j+i by applying 

Lemma 2.3 We introduce Sj as the threshold to distinguish the strong and weak 
double resonance in Section 


We also introduce the frequency vector 

(A A\ j _ \ ( -^2 P3 Pj+3 

^ Q 2 q3 qj+3 ' 


j = 0,1,.. .n - 3 


associated to the original system ( 1 . 1 ) and related to the frequency ooi 
P = 


= 


M' ^(n+2M[ ^. .Tr+2(^M'j_\(n+2U}i 


The integer vector G corresponds to integer vectors \d> G Z” that is perpen¬ 
dicular to Ua. We introduce a function Zj{x,y) consisting of those Fourier modes 
of P{x, y) in spanz{k^,..., k-^}. We next introduce a function Zj(x 2 , X 3 ,..., Xj+i, y) 
which is obtained from Zj by transforming the x variables by a matrix in SL{n, Z) 
whose (j -|- l)-th row is V.P The genericity condition that we need is the following, 
guaranteed by Theorem |3.5[ 

(Hl)^-: Uniformally for all y G F(jj := dh~^{u+), the function ■^)(x 2 , • • • ,Xj_|_i,y) 
has unique non degenerate global maximum, i.e. the Hessian of Zj in (x 2 ,... ,Xj+i) is 
negative definite at (x 2 (y),... ,xt_|_^(y)) which is the global maxima point, with finitely 
many exceptions for which there are two non-degenerate global maxima. 
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We get a unimodular matrix M'- G such that 

P. 


(4.5) M'a;^=(a,0, 


j+3 

Qj+3 


5 ^n—j—3 ) 5 


M' = 


10 0 
0 {Mj)2x2 0 

0 0 idj2_j_3 


The matrix M, and the rational number ^ - 

^i+3 Vj+29j+3 

follows 


are defined inductively as 


Mi 


Pj+2 

r 

Qj+2 


P 3+3 


1 qj+3 J 



Qj+2Pj+Z Pj+2qj+Z 


di 


di 


1 

Pj +2 


r 0 1 


Qj+2 

P 3+3 

= 

P 3+3 

- 

- H+3 - 


Qj+3 


'3 ^3 

where dj = g.c.d. {Qj+ 2 Pj+ 3 , Pj+ 2 Qj+ 3 } and rj, Sj are integers such that 

Sj{Qj+2Pj+3) + rj{Pj+2qj+3) = dj 

which are determined uniquely by the Euclidean algorithm. 

Next we denote the linear symplectic transformation 

■ (xi,yi) ^ (M'xj,M'“Vi) 

7r_291l'' : (xj,yj) ha (7r_2(M'xj),7r_2 {{M')~^yj)) := {xj+i,yj+i) 

so that each of Xj+i and yj+i has n — (j' + 1) components. After the reduction of 
order done in Section 3.2, we get a Hamiltonian system with the same form as (4.1) 

Ne ch( 

Pj + 4: 


while the subscript j —)• j + 1- We choose any rational number satisfying 


(4.6) 

and denote the frequency vector 

(4.7) ui+^ = Xa,j+1 


Qj+A 


— OJ 


J+4 




^3+^ P3+^ 
h /W ’ 
b/i+3 Qj+A 


) ^n—j—A 


»n-(i+l) 


Now we have complete the induction from step j to the step j + 1 in the case of single 
resonance and weak double resonance. 


Proposition 4.1. Assume (Hl)^- for the Hamiltonian system (1.1) along all the fre- 

queney segments Ua at single resonance. We repeat the above procedure for n — 2 steps 
to get a frequency vector 


(4.8) 


(a,—M' ^i7T+2M[ ^ . M'^_\{TT+2U3'f ^)) 

^ q2 <?3 qn' 


where ^ = fi> ^ ^a. is such that alAfsi^a)) = E, and we determine the list of 

y 2 W Q3 y ^ 

^ and a hierarchy of size of neighborhoods g-o ^ ^ ^ Pj-3 the following 

way: 


(1) Once pj is chosen, then is chosen to he any rational number satisfying 
P 3+3 


Q 3+3 


— u 


3+3 


< Pj, i = 0,1,... ,n - 3. 


( 2 ) Once is chosen, we first choose < A(k'.^) applying Lemma 


3.3, where 


is determined by ..., inductively through (4.2), then determine Pj+i 


from 5j+i applying Lemma 


2. 5| with K = . 
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The Hamiltonian system is reduced to a system of two degrees of freedom as follows 

1 1 •r 

(4.9) Hn-2{x,y) = ■^^y2 + i^{Myi^y2),{yi^y2)) + Sn-2^i{xi,X2)+e^(-+^>Rii{x,y), 

where A £ is positive definite, the number lo ^ 0 is independent of e, 6 n- 2 , 

IR/Ic) \ Rii\c ^-23 are bounded as 6 n- 2 ,^ 0 and Sn -2 is as small as we wish. 


In the resulting system (4.9), we will later perform a standard energetic reduction 


to reduce it to a system of 1.5 degrees of freedom, whose time-1 map is a twist map. 

We only remark that how the factor A| is determined. We first determine the 
before any reduction of order in Section 2.1 During each reduction of order, the value 


Aq is modified (see Section 3.8) due to the refinement of the frequency and refined 
normal form. Eventually, after all the reduction of order, the system restricted on 
the NHIC is a system of two degrees of freedom and the frequency vector Ua is the 
rotation vector of the Aubry-Mather sets on the cylinder. Hence the A^ is naturally 
determined by the equation a{J!fis{uJa)) = E, where E is the fixed energy level. 


4.2. Induction around the strong double resonances. In this section, we per¬ 
form the reduction of order around a strong double resonance. A strong double reso¬ 
nance may appear during the j-th step of reduction of order. Without loss of generality 
and for simplicity of notations, we assume we encounter a strong double resonance 
point before any reduction of order. 

The reduction scheme for strong double resonance is different from that in the single 
resonance regime since different Hamiltonian normal forms are used. To match two 
different regimes, we control the frequency vector oJq for each j, such that they have the 
same choice of rational numbers in the two different regimes. After totally n — 2 steps 
of reduction we get n — 2 linearly independent integer vectors k'^,..., G Z” \ {0} 
which determines a curve ujt = ^^,..., G M) as a varies, where A^ 

is such that a{dp{u:'i)) = E for some fixed energy level E. This frequency vector ut 
the rotation vectors of Aubry-Mather sets lying on a NHIC of dimension two (for the 
Poincare return map that is twist) where the NHIC is known to exist. Since the same 
rotation vectors determine the same Aubry-Mather sets, in the overlapping region 
that can be handled as both single and double resonance, the two reduction schemes 
give the same result. For some a, there might be one more integer vector k" with 
norm bounded by a constant independent of any d’s that is perpendicular to uh. 

The procedure is divided into the following steps: KAM normal form, linear sym- 
plectic transformation, shear transformation, the reduction of order and undoing the 
shear. 


4.2.1. The KAM normal form. In Section 3.7, 3.10, we have shown how to perform 
the first and second steps of reduction of order. In the following, we describe the 
procedure inductively. For each j, oji has two integer vectors G perpen¬ 

dicular to it. These integer vectors are transformed to vectors k'', k'^,..., that 
are perpendicular to ui through 


(4.10) k* = < (... (m;_2 {n+2 {Mf_,n+2{yY)))) 

with corresponding superscripts and gives rise to the same k" = k"^ for all j. 
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In the following, we define j = j + 2 G {2, 3,..., n — 1}. Here j counts the time of 
reduction of order and j counts the number of resonance relations. We perform KAM 
iterations to as we did in Proposition |2.2| and |3.2| to get the following normal form 

=^{L,i,Y) + ^{AY,Y) + Pj{Y) 

a:),..., x)) 

i=2 

+ + e'^Rjjiix, Y) 

where dj+i can be chosen as small as we wish, and (52 <^3 (^j are chosen and 

fixed as our inductive hypothesis. 

Our goal in this section is to determine an upper bound for <5j+i and will fix <^j+i 
provided the upper bound is satisfied in order to proceed to determining ^]+2- 


4.2.2. The linear sympleetie transformation. We construct a matrix Mj G SL(n,Z) 
whose first j rows are exactly k'', k'^, k'^,..., Readers can skip the next paragraph 
if s/he is familiar with this fact. 

We denote by Ma = M''M' and M 3 = M"M' 2 M(, Mj = ■ ■ ■ M[) 

where Ml is the enlargement of Ml into a matrix in SL{n,Z), Ml = ( 7 r+ 2 )*~^M', i = 
2,... ,j and M''^^ has the following form 


M 


// _ 

i+i “ 


k"i+i 

d(j+l)xl ^djr+1 X (n—j—2) 

*(n—J —2)xl d(n—J —2)x2 *(n—j—2)^ 


where is defined from the relation k"'^+^M'_(_^ • • • M^M' = k” and the * entries 

are defined as follows. In M'l_|_;^, we remove the second ... (j + l)-th columns and rows 
to get a matrix of {n — j — 1) x (n — j — 1). Then from the resulting first row, we find 
n — j — 2 integer vectors spanning unit volume with the first row. The matrix M^^ 
determined in this way is in 5L(n, Z). We know by construction that the first j rows 
of the matrix Mj are k", k'^, k'^,..., respectively. We relabel these vectors by 
k^,... ,kJ respectively and denote by Kj the matrix in Z^^*^ formed by these vectors 
as rows. 


The matrix Mj induces a sympleetie transformation 

TRj ; {x,y) ^ {MjX,Mr^y). 

We denote Aj = M-^AMj and the frequency vector uf is transformed to M-^Ua = (Oj, w) 
for some non resonant vector w G Then the (i,j)-th entry of Aj is given by 

Oij = k*A(k'^)*. The Hamiltonian (4.11) under the transformation becomes 

1 1 ^ 

+ -(AjT, Y) + J2SiVi{xi,X2,...,Xi) 

i=2 


(4.12) 


+ Pi{Y) + (5j+iR,,Kx) + e^Rjjiix, Y). 


where Pi{Y) = dJt- ^Pi(y), similarly for Rj j and Rjji- The genericity condition that 
we need is the following which can be proved generic following the same argument as 

(H2)2. 
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Assume (H2)j-. For generic Vj +2 £ C"’(T^,M) (r > 5) and sufficiently 
small Sj+2, the action minimizing invariant measures with rotation vector z^(l, 0 ,..., 0 ) 
\v\ G [ 7 , 00 ), V 7 > 0 0 / the system Y) + ^^{^2 ■ ■ ■ ,Xi) are supported 

on one hyperbolic periodic orbits except for finitely many v ’s for which there are two 
minimal hyperbolic periodic orbits. These periodic orbits make up finitely many pieces 
NHICs. 


4.2.3. The shear transformation. We introduce a shear transformation in this section 
to block diagonalize the quadratic form {A-fY.,Y). Let 


ij ij 


I A\ ij 


idj 


0 


^ idn-j 

= ij — i|ij“^ij where ij G ij G and ij G It can be verified 

that = diag(ylj, i?j). We introduce a linear transformation 

6/j : {x,Y) {SpjX,Sfi.Y) := (x,y), 

which transforms the Hamiltonian into the following defined on T*T^ x 

j 




+ '^SiVi{xi,X2,...,Xi) 


(4.13) 


i=2 




+ Pl{Sjjy) + Rjjl{x, y)) , 

where yj = (yi,..., yj) and y^-j = (Vj+i, • • •, Yn)- The norms of Bp Rjj, Rjji depend 
only on (J 2 ) • • •) <^j, but not on 

Next, we further block diagonalize the matrix ylj. We decompose Aj into the form 


yli = 


A a| 
as 033 al 

34 ^44 


••• 

aj ajj 

where A is the first 2 x 2 block, an are the diagonal entries and a* G is a vector 
formed by the i — 1 entries in the i-th row to the left of the diagonal. We next form 
matrices 

idj 0 0 

-aj+ii”^ 1 0 

0 0 idn—i —1 

id 2 0 0 


R = 


(4.14) 


5//J =T2---rj_i = 


-asA 


-1 


1 


0 


0 0 
0 0 

0 0 


—aiAi 


-1 


0 


1 


0 


0 id,2_j j 


0 0 

We denote 

h = an - a*i-_\a* = k* (A - (AKti)(K,_iAKti)-'(KtiA)) (k*)* 
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where Ki_i is a matrix in whose rows are k^,... ,k® We denote the first 

entry of the vector by Si so the first column of S//j is (1, 0, —S3 ,..., —sj, On-j). 

The same Gauss least square argument as in Section 3.10.6| gives us 

(4.15) bi = constft^i|k®p = constb^i|k'®“^p, Sj = consts^i|k®| = consts_i|k'®“^|, 

where const;,^j(> 0) and consts^j do not depend on Si (but might depend on <5^, k < i). 

With these definitions, we diagonalize the matrix A. Let 5j = then 

Sj^j5- = diag{i, 63, • • •, ^j, Bj}. 

We introduce the transformation Sj: (x, y) —>■ (Sjx, Sj~*y) and obtain a Hamiltonian 
defined on T*T‘^ x M2(n-2) 

j 




(iy,y) + 62V2{x) + {SYl.{x^,6n-i)) + jyf) 


(4.16) 


i=3 


+ jjYn—j) + 2 


+ &r^*{Pj{y) + 6;+iRj4x)+e^Rjji{x,Y)). 
where Xj = (xi,... ,Xj) and the hat notation is standard. 


4.2.4. The reduction of order and undo-shear transformation. We perform the order 


reduction to the system (4.16). The general idea is always to separate the first two 


degrees of freedom from the system and apply Theorem 3.4 to find a NHIC in the 
subsystem with homology class g = (1,0) G Hi(T 2,Z). Restricted to the NHIC by 


applying Theorem 3.2, we reduce the number of degrees of freedom by one. We fix the 
homology class g = (1,0) G Hi(T 2, Z) since we have zero as the second component of 
the frequency vector in ( |4.5|) af ter applying Mj. The first two steps of the reduction 
is done in Section 3.7 and 3.10 Since the shear transformation Sj ^ SL{n,'L) and it 
changes the resonance relations, we always need an undo-shear transformation after 
one reduction to get a subsystem of two degrees of freedom defined on T*T‘^ rather 
than T*T^ X Suppose that after the Tth reduction, z = 1, 2,..., j — 1, we obtain 
a Hamiltonian defined on T*T^ x M2(n-i-2) 


(4.17) 


Hjji — ^*-|-l(7j+l) -|- (lj-|_2.^i,i+2(7i+l, Xj^2) T 2 ^*+2yi+2 

T ^ ^ T 5 A:-^i,A;( 7 i+l, Xj_|_ 2 , . . . , Xfc)^ 

k=i-\-3 

“t“ ^ j? Yn—j) j? Yn—j) “1“ ^ 

y S 2 

where Zi^k, Pi,i, Ri,i, Ri,ii are the i-th reduction from Vk,Pi, Ri, Rn respectively. 

Now the bracketed subsystem is only defined on T*T^ x on r*T2. The 

new undo-shear matrix 

■ 1 0 0 

Si= Si +2 1 0 

0 0 idn-i -2 

transform the bracketed term above to 

hiJ^l{Ji+l + Sj+2Yfi+2) + 2^*+2Y?+2 + ^i+2Zi^i+2{Ji+l + 'Sj+2Yi+2; Ti+l^^i+2 ~ 'Sj+2<^i+l); 
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where Jj+i = /j+i — Sj+ 2 y*+i and Xi +2 = 'Sj_|_ 2 ¥^i+i +Xi+ 2 - Now we obtain a system 
of two degrees of freedom defined on T*T‘^ and we can perform the reduction to get 
a system of one degree of freedom denoted by hi^ 2 {Ii+ 2 ), and a system (4.17) with i 
updated to i + 1. We repeat this procedure until i = j — 1. The NHIC obtained by 
applying Theorem 3.4 to the j — 1-th step determines an upper bound for in order 


for Theorem 3.1, 3.2 to be applicable. This completes the induction for j. 


4.3. Reduction around complete resonance. Suppose we have completed all the 
reduction of orders hence determined all the <5’s. We set j = n — 1 in the previous 
section. In this section, we list some final normal forms that will be used for construc¬ 
tion of diffusing orbit in the later sections. As we did in Section |3.5[ we distinguish 
low energy (|y| bounded by constants independent of e but allowed to depend on 6 's) 
and high energy region and restrict our attention to the low energy region. So we can 
absorb the Pj term into the perturbation e^Rji using (2.16). 

Removing the last row and column of A^-i = we get a matrix A G 

GL(n —1,M). The (i, j)-th entry of A is Uij = k*A(k'^)*. With these notations, (n —2) 
steps of KAM iterations lead to (4.11) with j = n — 3 and \=j + 2 = n— 1. Followed 
by a linear symplectic transformation : (x, Y) i—)• (M„_ix, M~\Y) one obtains 


2 \ \i _^ 

(4.18) Hn-l = -^{uJn,Yn) -{An-lY,Y) ^ V)(xi, X2, . . . ,Xi) -b 5nR{x). 

^ i=2 

Slightly different from the previous procedure, here we first perform a standard ener¬ 
getic reduction to reduce it to a system of n — 1/2 degrees of freedom. As / 0 and 
e > 0 is very small, one has the function Yn{x, Xn, v) as the solution of the equation 

Hn-i{x,Xn,y,Ynix,Xn,y)) = E* > 


which takes the form Yn 



where 


(4.19) 


Ys = 


1 

2 


n— 1 

{Ay, y) + '^ SiVi{xi, ...,Xi) + 5nR{x, Xn, y) 
j=2 


where we update the notation x = (xi,..., Xn-i), y = (Ti, • • •, Tn-i)- 


We next apply 611^2 in (4.14) with j = n — 1 to block diagonalize the quadratic 
form A where we remove the last row and column of 811 ^ 2 - The Hamiltonian system 
has the normal form defined on r*T^ x ^ qp! 


(4.20) 


Y, := ej^*^Ys = \{Ay,y) + 52V2{x) 

n-l /, X 

+ ^‘S'//j(xi, 6n-i-l)^ j + SnR, 

i=3 ^ ^ 


where x = (xi,X2),Xi = (xi,...,Xj). Similarly for y,yi. With this Hamiltonian, we 
perform the reduction of order as we did in the previous section. During the i-th step, 
i = 1, 2 ,..., n — 2, we have the following normal form defined on T*T^ x M 2 {n-i- 2 ) 
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after an undo-shear transform 


= 


hi+l{Ji+l + Si+2yi+2) + 2^i+2y'i+2 


(4.21) 


+ ^i+2Zi^i+2{Ji+l + 'Si+2yi+2) a;j+2 — 'Sj+2¥’i-|-l) 

n—1 ^ 

4“ ^ ^ 4“ Xj-|_2, Xj^3 ..., x/j)^-|- 

k=i+3 


Remark 4.1 (Smoothness reqnirement). We need to perform n — 2 steps of order 
rednction. One step of reduction (either for single or for double resonance) makes 
the reduced system lose two derivatives. In each KAM normal form, the Rj term is 
obtained from P{x, y) by taking the high frequency Fourier modes without taking any 
derivatives and the Rjj loses two derivatives. The genericity conditions (Hl)j, (H2)j 
have nothing to do with the Rjj terms hence the smoothness are always gnaranteed 
for C"’-Hamiltonian with r > 2n. The reqnirement of smoothness here coincides with 
the threshold of smoothness so that KAM tori exist. The KAM theorem holds when 
pertnrbation is C'^”'^^-small and recently, it was shown in mu that KAM theory 
does not apply if the perturbation is small only in C'^”“'^-topology. 


4.4. Admissible frequency path. In this section, we provide an algorithm to move 
the second, third ... components of the frequency vector. 


Definition 4.1 (admissible frequency segment). A frequency segment of the form 
■ ■ ■ ,^) is said admissible {up to permutation of entries) for the 
Hamiltonian system H defined in (1.1) which satisfies the generic conditions (Hlj, 
H 2 )j) along the segment, if there exists a hierarchy of numbers 

/xo > /xi > ... > yn-s 

and a Diophantine vector cD *_3 G DC(n — 2, a, r) such that 


( 1 ) 


Pj+3 


— OJ, 


< hj, j = 0,1,... ,re - 3, 


<?j+3 "j+3 

( 2 ) p.j +1 is determined by dj+i through Lemma 


2.3 with K = 5 


- 1/2 


and 5. 


4+1 


< 


A(k'-i) is chosen arbitrarily, where A(k''i) is given by Lemma 3.3 and k'-i is 
determined by using ( |4.2[ ) inductively. 


Definition 4.2 (p-accessibility). For any given p, we say two vectors cu* and are 
p-accessible to each other if a p-neighborhood of oj® can be joined to that of by 
piecewise admissible frequency segments. 


We will show that the accessibility implies the existence of diffusing orbits. In this 
section we prove the following theorem 

Theorem 4.1. Given any two vectors uA, 10 ^ G and any small number p > 0, there 
are two vectors g M** that are accessible to each other and satisfying 

Iw® — AU®| < p, \lv^ — < p. 

To prove this theorem we need the following number theoretical lemma. 

Lemma 4.1. Given any p > 0 ,t > n and any two frequency vectors a;® 7 ! G 
dh~^{h~^{E)), there exist constant a > 0 and two vectors 

= «,..., a;;®), co*f = {ul^,... 
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satisfying the following 


\oj -uj \ < p, 

*f , *f 


— U)*^ 


< P 


and the (n + 1) vectors G DC(n + l,a,r) for all j = 

0,1,2,... ,n. 

Proof. Fix p > 0,T > n. We prove the lemma by induction from j + 1 to j. First, for 

f 

j = n, it is easy to find two Diophantine numbers ujn- Suppose we already have 
that 

(w*+i,...,w”) G DC(n-j + l,a,r). 

We claim that there are numbers w*® and satisfying |a;*® — w® | < p, \oJ*^ ~^j \ P’’ 

and 

DC(n -3 + 2 , a, r) 

for sufficiently small a > 0. Indeed, by assumption we already have 


(1, k„-j+i 


> 


a 


|kn-j| 


V k„_,+i G Z®®-^+i \ {0}. 


We want to show that all those ujj G M which satisfy the condition 


(4.22) 


|((l,a;j,a;„_j), k„_j+2)| > 


a 




-,V (A;,-,k„_,+2) GZ®®-^+2\{0} 


form a p-dense set provided a is small enough. Given k„_j+i, we consider all kj and 
t 


oj'- satisfying 


kjuj] + ((l,w;®_j),k„-j+i) = 0. 

Formula (4.22) is satisfied automatically when kj = 0 so we assume kj ^ 0. In order 
to guarantee (4.22|) we need to remove an interval of measure ,, centered 

at wj so that (4.22) is satisfied for all ujj in the complement for this kj. The total 
measure of these intervals removed when kj ranges over Z is 


E 


2a 


, l^jl (|kn-j| + I^D® 


< 2 


2a 


h x(|k„_j|+x)^ 


dx. 


Next the total measure of these intervals when k„_j ranges over Z®® ^ is 
4a /■“ 4a 


EE 


< V / - 

\kj\{\\in-j\ + \kj\y x{\kn-j\+xY 

4a 


dx 


Jl Jl 


< 
y=xlr 


x{r + x)' 


dx r®® ^ ^ drdS®® ^ 


- 7-1 


4aC' 


n-j-T- 


/■oo 2 


inner 


where the constant C = r^n-j-iyl) sphere S®® The i 

integral converges for large y and has the asymptote logr for r large and y close to 
I/r. Hence the iterated integral can be estimated as 

I 


roo PC 


dydr <2 




(log r + const) dr 


n/r y(l + y) Jl 

where the right-hand-side is convergent since t > n. The assertion above is proven if 
a > 0 is chosen small enough. □ 



























ARNOLD DIFFUSION 


55 


In virtue of this lemma, we are able to prove Theorem 4.1 


Proof of Theorem \4.1\ As our strategy, we start with an admissible frequency segment 
of the form Ua = (a, ..., with numbers 3> ''' such that 


^ 1+3 - 


Pj+3 

Qj+3 


< Pj, j = 0,1,... ,n - 3. 


Next, we move the number a to the point 


Pn+1 


satisfying 


I Pn + 1 


— < fin -2 for some 


Qn+1 ' Qn+1 

number fin- 2 - We want the line segment to a = (021 admissible 

such that we can move 02 to the point satisfying for some 

number fin-i, and so on. 

We need to show how to determine the rational numbers as well as the numbers 

9j+3 

fij for j = 0,1,..., 2n — 3 inductively. We claim: 

for qiven ui*^ and , there are rational numbers — as well as numbers 

/io > /^l • • • > h 2 n -3 such that 


(4.23) 


^i+3 - 
^j+3 - 


Pj+3 


Qj+3 

Pj+3 


Qj+3 


< fij, 

< fij, 


j = 0,1,... ,n - 3 
j = n — 4,1,..., 2n — 3. 


Moreover, the n — 1 vectors 


Uj := 


ttj+i 


Pj+2 Pj+3 


Pn+j \ 

’ / ■ 

Qn+j ' 


j = 0,l,...,n-3 


dj+2 qj+z 

are all admissible. 

This implies the accessibility of the two vectors w** and uj*^. 

Let us first show how to determine the numbers fiQ,fii, then we explain how to 
determine all the remaining numbers inductively. 

For the frequency segment coi = (ai, ..., to be admissible, we first choose 

rational numbers ^ that are p close to w” and a;” respectively. Next we get 
Pq determined by 5o < A(kQ) (where kg is determined by which is in turn 


determined by item (2) of Proposition 4.1 (see also item (2) of Definition 4.1). Next 


^ can be chosen to be any rational number satisfying | ^ — w” | < pi according to 


item (1) of Definition 4.1 


Next, applying the same procedure we determine a number pi ^2 by the integer 
vector k'^ ^ which is determined by using (4.2). We can choose ^ to be any 

rational number satisfying < pip. Here the first subscript 1 of pip,'k'i i is 

inherited from ai meaning the component that is moving and the second subscript is 
the usual subscript for the p counting the number of reduction of order. 

However, we need to guarantee that the second frequency segment U 2 = ( 02 , ^5 
tP++^ to be admissible. For U 2 , we determine a number p 2 p by the integer vector 


9n + l • 


1^2 1 which is determined by ^ using (4.2). We can choose || to be any rational 
number satisfying — w”! < p 2 p. 


To satisfy the above two restrictions simultaneously, we denote p 2 = m.in{pip, p 2 p} 
and choose ^ to be a rational number satisfying |^ \ < p 2 . Once ^ is chosen, 

it is fixed. 
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Inductively, we dete rmine for cjj, i = 1, 2,..., j, j = 1, 2,..., 2n — 3, from item 
(2) of Proposition |4.1 once is given. We define := minj<j{/rjj} and choose 

to be a rational number satisfying (4.23). This completes the proof. □ 


5. Mane sets along s-resonance and around tti-strong resonance 


To construct diffusion orbits, we are going to use variational method closely related 
to the Mather theory. Towards this goal, we need to know sufficient information on 
the structure of relevant Mane sets. 

Recall the procedure of reduction, there are two types of reduction, one is for single 
resonance, another one is for strong double resonance. Except for small neighborhoods 
of finitely many resonant points, along remaining intervals all n — 3 steps of reduction 
are the type of single resonant case. We call it s-resonance (always in single resonance). 
At those points we are encountered strong double resonance for the first time at m-th 
step of reduction, for some m £ {I,- - ■ ,n — 2}. We call this case m-strong resonance, 
as all follow-up reductions are the type of double resonant case. 


5.1. Mane set along s-resonance. As the first step, let us study the normal form 
(2.18) at single resona nce t o see where are t he Mather sets we are concerned about. 

the system (2.18) generically admits a NHIC which 


3.2 


As analyzed in Section 
is diffeomorphic to For the Lagrangian Lh' determined by H', we claim 

that, for each c G (wi, 0, t5n_2), the Mather set stays on Indeed, for 

5 = 0, the Mather set for such c is obviously located on n^_i, which is normally 
hyperbolic. For c G int.if/ 3 ^, (wi, 0, tDri- 2 )) the Mane set is the same as the Mather set. 
Because of upper-semi continuity, for small 5, the Mane set is in a small neighborhood 
of the manifold. On the other hand, due to the hyperbolic structure, any orbit cannot 
stay in a small neighborhood of unless this orbit entirely stays in n^_i. This 

verifies our claim. Restricted on this manifold, the Hamiltonian has (n — l)-degrees 
of freedom. 


Next, let us study the normal form (2.19) at strong double resonance. Under the 
linear transformations 971' and 971" 


H" = (97l"-i97l'-V)*^ + A-^AYn-2), {Y + A-^AYn-2)} + V{x) 

T 7 ,(X'ri— 2 : (A — A^ A ^ AyYn—2) H- l={^n— 2 i Pn—2) 

2 ye 

+ 5R(x,y). 


Let Th" be the Lagrangian determined by H" and assume temporarily 5 = 0. Under 
generic condition (H2.1, H2.2) on V and applying Theorem 3.4 there is a manifold 


which is normally hyperbolic and invariant for and diffeomorphic to T" ^ x 


1° 

n- 

As this system is a direct product of mechanical system and an integrable one, 
for each c G int.if^^„ (cui, 0, 0 )^- 2 ) the Mane set stays in Tbe manifold 

undergoes slight deformation —?■ n^_i for small 5 > 0. For the same reason, the 

Mane set for c G int.if/ 3 ^,, (wi, 0, t!)n-2) also entirely stays in n^_i for small 5 > 0. 


Lemma 5.1. For C'^-Hamiltonian H{x,y) defined on and the first eohomology 

class c* G we assume that the Mather set A4(c*) is supported on a Xg- 

minimal hyperbolic periodic orbit with g G Lli(T^,Z). Then, the set ^^j^{Xg) {Fenchel- 
Legendre transformation of Xg) is an interval {c* + [c_,c+]cg} C with 
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c_ < c+ and ||cg|| = 1 such that for each c G {c* + (c_, c+)cg} we have 

A{c) = M{c*). 

Proof. The proof of this lemma is a variant of Theorem 3.1 of |C12j . As the system is 
autonomous with two degrees of freedom, is either an interval or a point. In 

the case of interval, some Cg G exists such that .^p^{Xg) = {c* + [c_, c+]cg}. 

It follows from |Ms] that for all classes c G {c* + (c_, c+)cg}, the Aubry sets A{c) are 
the same. Let us show that c_ < c+ and A{c) = M.{c*) for c G {c* + (c_,c+)cg}. 

Given any absolutely continuous curve 7 , its Lagrange action is defined as follows 
Acil) = j LhH, 1 ) - he + anic) dt, [gd = c. 

Denote by 70 the hyperbolic periodic orbit, we consider minimal homoclinic orbits to 
7 o, which is located in the intersection of the stable and unstable manifolds of ( 70 ,7o)- 
A homoclinic orbit ( 7 , 7 ) is called minimal if the lift of 7 , 7 : M —?• M is semi-static 
for the class c*, where M is the largest covering space of so that 7 ri(M) = 
holds for each open neighborhood of M.{c*). Because of the topology of T^, there 
are only two types of minimal homoclinic orbits, denoted by ( 7 ^, 7 ^). Given a point 
X G 7 o, there are four sequences of time ^t± such that 7 {tf_) —)■ X as tf_ ±00 

and 7 ^(t^_,_) —> x as tfj^ —>■ ± 00 , tfj^ —)> ±00 and tf_ —>■ ±00 as i —>■ 00 . We define 

Ac( 7 “,x) =liminf / ’ (Lh( 7“>7“) “ (c,7“) + «R'(c) 

i^OQ J j.— \ / 

Ac( 7 +,x) =liminf / (L/^( 7 +, 7 +) - (c, 7 +) + 

2^00 ! + — \ J 

We obviously have Ac* ( 7 ^, x) > 0. Next, we claim that 

Ac* ( 7 +, x) + Ac* ( 7 “, x) > 0 . 

Otherwise, we would have Ac*( 7 ^) = 0 for both ±, which implies that 7 ^ C A{c*). 
However, this violates the graph property of the Aubry set since [y"*"] 7 ^ [ 7 “] in the first 
relative homology group 70 , Z), the projections of 7 ^ on must intersect. The 

contradiction proves our claim. Let us assume Ac*( 7 '’“) > 0 without lose of generality. 

Since the minimal measure for class c* is supported only on the periodic orbit, 
J\f{c*) is composed of those minimal homoclinic orbits along which the action equals 
zero. According to the upper semi-continuity of Marie set in cohomology class, any 
minimal measure /ic is supported by a set lying in a small neighborhood of these 
homoclinic orbits if c = c* -|- Ac and |Ac| is very small. Therefore, it is possible that 
He is supported in a neighborhood of 7 “ only when Ac* ( 7 “) = 0 . 

For a Ac / 0 so that (Ac, [ 70 ]) = 0, we get (Ac, [ 7 ’'']) > 0 and (Ac, [ 7 “]) < 0 by 
changing Ac to —Ac. We claim that the minimal measure gc for c = c* -|- Ac is still 
supported on this periodic orbit. Indeed, if gc is not supported on that periodic orbit, 
its support lies in the small neighborhood of 7 “, it follows that —(Ac, p(//c)) > 0 . 
On the other hand, as the c*-minimal measure is uniquely supported on the periodic 
orbits, the a-function is differentiable at c* and p{gc*) = -^[ 70 ] bold for certain number 
A. Therefore, we have oih{c* + Ac) — anic*) = 0(|Acp). Consequently, we obtain 
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from the definition that 


= j{L h - ric*)dHc + OiH{c* + Ac) - (Ac, pinc)) 

= J {Lh - Tlc*)dpc + anic*) - (Ac, p{pc)) + 0(| Acp), 

from which we have Ac{pc) > 0 as Ac*{pl) > 0, —{Ac,p{pc)) > 0 and 0(|Acp) is a 
higher order term of |Ac|. The contradiction implies that ^p^{Xg) is an interval, not 
a point. □ 


Along the admissible resonance line = A|(a, • 


Pn ' 
q-n ,' 


in (4.8), we encounter 


finitely many strong double resonance points. Once a strong double resonance appear 
during the step of reduction, it continues to be strong double resonance point for 
all the later steps. We remove an interval centered around each strong double 
resonance point. We denote by a" and a" two consecutive strong double resonance 
points. Then we get an interval [a_,a+] satisfying \a-—a'!_\ = |a+—a'_(_| = e”", a < 1/2. 


With the normal form of the Hamiltonian systems in (4.9) and (4.21) after all the 


reduction of orders, we have the following description of the Aubry sets. 

Theorem 5.1 (Description of the Aubry set). Assume (Hl)j,(H2)j for the Hamil¬ 


tonian (1.1) during each step of reduction of order. Along the admissible resonance 


line there are finitely many strong double resonance points we have the following 
description of Aubry sets for the nonautonomous system along the admissible complete 
resonant line cja- 

(1) With finitely many exceptional frequencies where we denote a* for a typ¬ 
ical one, the set a G [a_,a+], is a {n — 1)-dimensional connected set 

that is homeomorphic to [0,1]*^“^. For each c G int.if^(cji), the Aubry set A{c) 
lies in a four dimensional normally hyperbolic invariant cylinders. 

(2) For the finitely many exceptions, and for each c G int.if(3(o;^* ), the Aubry 
set A{c) lies in two distinct four dimensional normally hyperbolic invariant 
cylinders. 

(3) The above two items hold also for a sati sfying \a — a"\ G w here 

la" '■= § 7 ( 7 ) foT 1 defined in Remark 


3.2 and to be determined in Section 


6.3 


Proof. After n — 3 steps of reduction of order, in Proposition 4.1, we arrive at Hamil 


tonian systems of three degrees of freedom possessing a four dimensional normally 
hyperbolic invariant cylinder. We know that the Aubry set of the original system lies 
inside the cylinder. Restricting to the cylinder, picking a Poincae section, we get that 
the time-1 map is a twist map on a cylinder of dimension 2 and the ut is the rotation 
vector of the Aubry-Mather sets of the twist map. The description of the set A^’p{uji,) 
in item (1) follows from repeated application of Lemma 5.1 □ 


5.2. The fiat Fq of the a-function. From now on we study a neighborhood of the 
complete resonance. 


5.2.1. The Hamiltonain systems. Recall the systems (4.19) and (4.20) 

n—1 


Ys = -{Ay,y) -\-'^diVi{xi, ...,Xi)-\- 5nR{x,Xn,y) 
i=2 


(5.1) 
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where x = {xi,.. .,Xn-i), y = {Yi,... ,Fn-i) G ^ and 

n—1 


(5.2) 


:=6 


- 1 * 

11,2 


Ys — {Ay,y) + V2(x) + ( —yf + (5iVj(xj) ) + 6nR, 

1=3 ^ 


defined on T*T^ x x T^, where 

(5.3) Vi(Xi) = f/j (57/(Xj,6n-j-l)) , X = (X1,X2), Xj = (Xi, . . . ,Xj) 

and similarly for y,y*. The matrix Sji G SL{n — 1,M) is given in (|4.14) with j = n — 1 


and the last row and column removed. We let 82 = ^- The system Y^ is defined 
on X and is defined on r*T^ x ^ qp!. The precise form of the 

term 5nR is not involved into the construction of diffusing orbit around the complete 
resonance provided it is small enough, so we do not bother to keep track of it under 
the transformations. 

We introduce the following subsystem of Y^ 


(5.4) 


G(k,y) = \{Ay,y) + V 2 {Y), (k,y) G 

i\2 


In Y^, when —?■ 0, the coefficient bt = const.|k*7 —>• 00 where k* is the i-th resonant 


integer vector depending on 6 i (see ( 3.49[ ) and (4.15) for the estimate of bi and Section 
4.2.1 and 4.2.2 for the definition of k*). To resolve this singular behavior, we make the 
following rescaling to get a new Hamiltonian defined on T*T^ x ^ T^denoted 

by 


(5.5) 


Vbiyi := Yi, ^Xj := Xj, 


f > 3, y = y, i = k. 


n—1 


(5.6) 


Ys = {Ay, y) + ¥ 2 ( 5 ^) + '^{-yf + <5iVi(k, 

i=3 


1 


+ 8 nR, 


where Vi(x, X3, ..., Xj) = Vi(x, v^axs,..., VbiXi). We introduce a matrix 

1 


S = diagjl, ^ 


n— 1 , 


So that Ys and Ys are related via the change of coordinates x = Sx, y = S~^y. The 
auxiliary system Y 5 is easier to study than Ys. In the following, we always study 
system Y 5 first, then transform the results to that of Ys using S. 

However, these transformations may lead to significant change of the norm of the 
potentials, which we estimate in the following. Notice that 5iVi consists of Fourier 



see that only the first i rows of Sjj enters Vj through composing with V). Since the 
estimate of the f-th row of 5// is the same as that of Si = const |k*|, we get that 


i|IC^ — |k;*|2n-4 • 


c 


Since Su in (4.14) is lower triangular with diagonal entries 1, we 


get that the dependence of Vj on Xj is the same as that of Vi on x*. This implies that 

ll<5iV,||c2 < ^ 


|ki|2n-4 • 

So we get that if we let 6 , 
norm. 


0, z > 3, in Y^, the potentials SiYi go to zero in the 
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5.2.2. The a-function and its flat. The a-function ay^ of is defined in the appendix. 
We denote the flat of ay^ by 

Fq := argminay,. 

By Theorem 3.1 of |C12j (Theorem |3.3| here) we know that Fq is an (n —1) dimensional 
convex set. However, this is far from being enough for us to handle the problem of 
crossing m-strong resonance. We need more information about the shape of the flat Fq. 
It is not straightforward to study the flat Fq. Instead, we study it through studying 
the flat for Y 5 and Y 5 . 

As for the a-functions of and Y^, since the Lagrangians are not defined on 
X T, it is not given by the standard definition in Mather theory. We simply 
define the a-function of Y 5 , Y 5 as the linear transformation of ay^ as follows 

aY,(c) := ayflS^jc), aY,(c) := ay,(S*c). 

We denote the corresponding flat for Y 5 and Y 5 by Fq and Fq respectively. 

To study the flats of the a functions, we introduce three systems obtained from 
setting (ij = 0 ,Vi> 3 inY 5 ,Y 5 ,Y 5 respectively. Notice that Yq, Yq are artificial since 
when 6 i —)• 0, we have bi —>■ 00 . However, we artificially set —>■ 0,f > 3 but leave 
bi fixed. We only use these artificially created systems to assist us to understand the 
structure of a functions, but not use them to construct diffusing orbits. The system 
Yq is well defined in topology by taking limit <5* —)• 0, i > 3, in Y^ directly. 

We denote by d : —>■ M the a-function for the G. 

Lemma 5.2. For the systems Yq, Yq, Yq, we have the following expressions for their 
a functions. 

n-l ^ ^ 

aYo(c) = d(c)-F ^ aYo(c) = d(c)-F-||c|||, ay^ = 5'7/*aYo = S"**aYo- 


Proof. We only need to prove the expression for ay^ and obtain the other two by linear 
transformations. From its definition 


-ayg (c) = 


(5.7) 


inf f L{x, x) dpi = inf f {A-^x, x) + V2{x) - (c, x) dp 
= inf y(S' 7 /A“^S' 7 /x,x) -F V2{x) - (c,x) dp 

= inf / (i“^x, x) -F V2{x) - (c, x) -F ^ ^x^ - (c, x) dp 
^ i=3 

r ~ . . . f 

= inf J (A“^x, x) -F V2{x) - (c, x) dp -F inf J ^2 ~ 


n—1 


= «(c) + wc: 


i=3 


^2 
2 * 


where in the second equality, we use Sjjx = x, SjjC = c = (c, c) while leaving x 
unchanged. Going back to the c variables, we get the expression of ay^. □ 


First we get that the flat Fq and Fq for the system Yq, Yq are a two-dimensional 
disk Fq X {c = 0}, where Fq is the flat of d. 
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The next lemma describes the flat Fq of the system Y 5 . It is not hard to get a 
description of the flat Fq and Fq after linear transformations. 

Proposition 5.1. For C”', r > 2 generic Vi, i = 2,... ,n — 1, the flat Fq of the 
systemYs is (n—1) dimensional and entirely stays in 0{y/5f)-neighborhood of the disk 
Fq X {c = 6 }. More precisely, the width of the cube in the Cj direction is 0{y/5i), i = 
3, 4,..., n — 1. 


Proof. The fact that Fq is n— 1 dimensional is given by Theorem 3.1 of [C12| (Theorem 
|3.3| here). Since we have |1(5 — lo| < we have (cf. [Cllj l 

|ay,(c) - aY,{c)\ < 63 , V c G 
After linear transformations, this gives 


(5.8) |aY 5 (c) - aYo(c)| < 

Since we have aYo(c) = d(c) + ^||c|||^, we get that 

aYo(c,c) > if |c| > K^/^. 


As a is non-negative, it follows from Formula (5.8) that aYg{c,c) > {\K^ 


due to Formula (5.8), we have minaY^ < ^ 3 - Therefore, 

OYs (c, c) > min oys , 


if 


1 ) 53 . Also 


This completes the proof for the 0 (^ 63 ) estimate. To get the 0{V5i) estimate for 
i > 3, we only need to set 6j =0, j > i while keeping 5j > 0, j < i and repeat the 
above argument. □ 


Therefore, the flat Fq looks like a pizza, horizontal in the direction of c with small 
thickness of order 0 (-v/^). 


5.3. The channel connected to Fq. Our diffusing orbit is constructed along a NHIC 
of dimension 4 (for the Hamiltonian flow, which can be reduced to dimension 2 for the 
Poincare map) until we get close to a complete resonance. T his N HIC corresponds to 
a channel in H^{ 
channel. 


pn—1 


according to item (1) of Theorem 5.1 Let us describe this 


In the KAM normal form ( |4.11 ) with j = n — 3 and j=j-F2 = n — l,we have the 
frequency segment Ua = xt{a,^,... ,^) as the rotation vector of the Aubry-Mather 
set on a NHIC of dimension 4 (for the Hamiltonian flow, which can be reduced to 
dimension 2 for the Poincare map). At complete resonance a is also rational. Applying 


the symplectic transformation 20ln-i in Section 4.2.2 and performing the energetic 


reduction, we obtain for the system Ys the frequency Aei along the cylinder and A = 0 
corresponds to the complete resonance. 


We define the channel C associated to the system Ys as follows 
(5.9) C = lJ^/3^,(Aei). 

A^O 

We want to see how this channel is connected to the flat Fq. The channel C transforms 
to that associated to the system Y^, Y 5 via linear transformations Sjj, S* respectively. 
We have the following description of the channel for the system Y^. Again it is not 
hard to get the corresponding description for the system Ys after the linear transfor¬ 
mation S*. 
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Lemma 5.3. The channel associated to the system is a neighborhood of 

Sn—l 


Vbs 


Vbn-l 


X 


where C = ■^ii^{X,Q) is a two dimensional channel for the system G correspond¬ 

ing to the NHIC obtained by Theorem 3.4 with homology class g = (1,0). The size of 
the neighborhood projected to the Cj component has width Moreover, the flat as 
well as the two arms of the channel with A > 0 and A < 0 respectively are centrally 
symmetric as —)■ 0. 


Proof. Again we look at system Y 5 and Y 5 . In system Y^, the frequency becomes 


SnXei = A(l, 0, S 3 ,, s^-i). 


In system Y 5 , the frequency is 


SAei = A(^1,0, 


■§3 gn-l \ 

Vfs""' Vbn-J' 


Next, we set <5* = 0, i > 3 in Y 5 to work with the system Yq. Performing the Legendre 
transform we get the following two dimensional channel for Yq 




a). 

Vbn-l ^ 


When we turn on the potentials in Y^ and Y^, this channel becomes (n — 1) dimen¬ 
sional according to Theorem |5.1[ Applying the same argument of Proposition |5.1| to 
the function — (Aei, c) instead of uys we get that for all c G C, there exists A 

such that 


C,; - 


^ A < i > 3. 


Vb; 


So we get the description of the channel associated to the system Y^. When trans¬ 
formed to the system Y^, the channel is a neighborhood of (C, ||A,..., 5 ^^A), whose 

projection to the c* component has width s/ZiJbi. 

Moreover, when we set <5n = 0 in all the three systems Y^, we get that the 

a function is an even function hence the flat Fq, Fq, Fq as well as the two arms of the 
channel for A > 0 and A < 0 are centrally symmetric. □ 


6 . How TO CROSS m-STRONG RESONANCES 


6.1. The difficulty and the strategy. For nearly integrable systems, it is unavoid¬ 
able to cross multiple-strong resonance if we want to find global diffusion orbits. 

As it is unclear to us if the channel C is connected to the flat Fq or not, we avoid 
entering the flat along channel. Instead, th e way we find stays away from the m-strong 
resonance by a distance (see Remark 3.2) and turns around the flat so that two 


channels are connected. It is for two reasons: the singular behavior (Lemma 3.4) of 


the action-angle coordinates, and the lack of the regularity of the Hamiltonian system 


restricted on the NHIC close to the hyperbolic fixed point ap plyin g Theorem 3.2 which 
gives us only a Hamiltonian since lnA/| ln/r| in Theorem 3.2 can be close to one. 

For three degrees of freedom system, the first named author discovered in [C12] a 
path of c-equivalence connecting to arms of the channel turning around but without 
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entering the flat Fq. This c-equivalence mechanism is essentially a diffusing mechanism 
for systems of two degrees of freedom close to the hyperbolic fixed point. 

When we apply the mechanism to systems with more than three degrees of freedom, 
we find that the path in (T^~^,'R.) has constant last n—3 components c for Ys, Ys, Y 5 
since the upper triangular matrices Sjj, preserves the constancy of the last n — 3 
entries. If two channels are connected to the flat with different “hight”, they can not 
be connected by a path of c-equivalence. 

For instance, if we move the frequency from Aei —)• — Aei, correspondingly, we send 
c = (c, c) —)• —c = (—c, —c). Due to the central symmetry, it is allowed that the image 
lies in a neighborhood of — c provided the neighborhood lies entirely in the channel C. 
We consider the system Y 5 for which we need to send c G C to a neighborhood in C of 
—c. A new difficulty aries (see Figure]^. The two arms of the C of s-resonance reach 
the flat Fq at different “height” due to the symmetry, and they are not connected by 
path of cohomology equivalence used in |C12j . along which the quantity c remains 
unchanged (contour line). Our strategy consists of two steps. The first step is to 
perform the cohomology equivalence by holding c = c* constant and sending the 
first two components c —)• — c (the red horizontal curve in Figure]^. So we get the 
cohomology (— c, c*) after the cohomology equivalence procedure. The second step 
is to send c* to —c*. To achieve this, we construct a short transition chain connect 
one arm of the channel channel to a contour line which intersects the other arm of 
the channel (the vertical curve in Figure]^. It looks like a ladder. So, starting from 
one NHIC we pass an m-strong resonance by following a curve of c-equivalence and 
climbing up to another NHIC by the ladder. 


6.2. Cohomology equivalence around m-strong resonance. We assume that 
the strong double resonance is encountered at the first step of reduction, i.e. the 
(n — 2 )-strong resonance, other m-strong resonances are treated similarly. 

Theorem 6.1 (Theorem 5.1 of |C12j ). Let Lq = 2 (A“^x,x) -|-l^(x) be the Lagrangian 
corresponding to the Hamiltonian G(x, y) via Legendre transform. A residual set QJ C 
(^^(T^jM) exists such that for each V 2 £ T1 and for each c G dFo, where Fq is the flat 
of the a function for G, the Mane set does not cover the whole configuration space T^. 


2 ], one can find the 
we get a flat Fq 


3.3 


Sketch of the proof It is actually one of the main results in [C 
more detailed proof in Section 3 of [C15b| . Applying Theorem 
of the a-function which is a 2-dimensional convex disk in ff^(T^,M). Let dFo be the 
boundary of Fq. The key step is showing that, under the hypothesis (H2.1), there are 
at most four classes on dFo with which the Mane set covers the whole configuration 
space T^. The rest of proof is easier, with extra perturbation, these four Mane sets 
do not cover any more. □ 


Guaranteed by the upper semi-continuity of the Mane set, we obtain the description 
of the structure of the Mane set extends to energy levels slightly higher than mind. 

Proposition 6.1. Given C"”, r > 2, generic V 2 , some positive numbers Aq > 0 exist, 
depending on V 2 , so that for each E G (0, Aq) and each c G a~^{E) there exists a circle 
Se C so that all c-semi static curves of L^ pass through that circle transversally 
and 

M^{c)nYc c 

where Lc^i C Sg are closed intervals, disjoint to each other. 
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Proof. By the upper-semi continuity of Mane set in first cohomology and the com¬ 
pactness of OFq, ceratin number Aq > exists such that Af{c) C if c G d ^{E) with 
E < Aq. 

The rotation vector of each minimal measure //£ is non-zero for any c G d~^(E) if 
E > 0. As the configuration space is 2-torus, all c-semi static curves have to turn 
around the torus in the same direction. Therefore, 3 a circle intersects these curves 
topologically transversal. Since the Mane set does not cover the 2-torus, the restriction 
of the Mane set on this circle does not make up the whole circle. □ 


A cohomology class c = (c, c) for the system is transformed to c = (c, c) that of 
via c = Sf/c. Next, a cohomology class c = (c, c) for is related to c = (c, c) for 
via a rescaling in (5.5). Since Sf/ is upper triangular, a constant c* is transformed 
to a constant c*. Namely, the constancy of the last (n — 3) entries of the cohomology 
classes is preserved by the Sf/ transform. 

Recall that aYo(c) = d(c) -|- 5 ||c||^^ in (5.2). Since we have Y^ — Yq = there 

exists some constant d > 0 such that 


(6.1) — dds < o;Ys ~ ckYq — ^ 63 . 

When we look at the energy level E with E — 5||c|||^ G {dds, Aq — the subsystem 
G in Y 5 has energy in (0, Aq). 

Corollary 6.1. For any energy E with E — ^ {dS 3 ,Ao — diJs), any (c, c*) 

so that aYj(c,c*) = E and for sufficiently small ^3 » ^4 S> ... S> <5^ > 0, all semi 
static curves of Ly^ with cohomology class c = (c, c*) {related to c(c, c*) via linear 
transformations) pass transversally through the section Sg x {x G 

(6.2) My,{c,c*) n (Se X {x G C |Jlc,i X {x G 


Proof. Let us consider the Mane set My^{c,c*) of > 5 , lift the last n — 3 components 
of points in the set to the universal covering space After the linear trans¬ 

formation relating Yg and Y 5 , the Mane set J\fYs{c,c*) is sent to an invariant set of 
Y^ denoted by Mys{c,c*) which has upper-semi-continuity as dj —>■ 0, i > 3 in the 
Lagrangian system Lyg. We have shown in Section 5.2.1 that the potentials djVj in 
Y^ converge to zero in the topology as <5* —)■ 0, i > 3. 

As the Hamiltonian flow is the direct product of <1>| and an integrable system. 


the Mane set A/yo(c, c*) of Yq is given by Mq{c) x {c*} x 
sufficiently small ds S> ^4 S> ...!§> dn > 0 , 


AAY,(c,c*)n cj/a 


on —3 


on—3 


So, we get that for 


Noticing that the matrix Sjj is lower triangular and is id 2 in the first 2x2 block, 
we get that the projection of the two sets Mys{c,c*) and My^{c,c*) to the first 
factor are the same. For the same reason, the section Sc C remains unchanged 
under the linear transformations. This completes the proof. □ 


The set rYo(c*, E) = {c£ M) : c = c*, aYo(c) = E} is homeomorphic to a 

circle in the plane id^(T^,M) x {c = c*}, encircling the disk Fq x {cfc = c*}. For any 
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c, c' G rYo(c*,£^), we have aYo(c) = aYo(cO an d c — c' = 0. Extending this property 
to the Lagrangian determined by defined in (4.19), for E > 5 ||c*|||^ + dS^, the set 


(6.3) 


rY,(c*,E) = {cGi/HT 




= c*. a-vJc) = E'i 


: c = c 


is a circle located in the 2-dimensional plane i7^(T^, M) x {c = c*}, encircling the disk 
Fq X {c = c*}. For any two classes c,c' on the circle, one has 


(6.4) 


c-c' = 0, aYsic) = aYsic'). 


We define 
(6.5) 


A(Ao,E)= U rY,(c*,E) 

a(c*)6[0,B] 

EG[dSg,AQ] 


which is an (n — l)-dimensional annulus. Each circle rY. 5 (c*,E) is transformed to a 
circle ry^(c*,E) for the system Ys after a linear transformation given by Sji and a 
rescaling, hence determines a circle in M) (See Formula (4.18) for the definition 

of Hn-i)- We next define 


rH„_i(c*,E) 


{(c,c,c„) : (c,c) G rY^(c*,E), Cn = aYsic,c*)}. 


Due to the following theorem, aH„_iic,Cn) = E* holds for all (c, c„) G rH„_iic* , E). 


Theorem 6.2 (Theorem 3.4 of |C12| 1. For a Hamiltonian H{x,y,Xn,yn) we assume 
that dy^H ^ 0 on {H~^iE)} n {y„ G [yn^vi]}- ^ct yn = Ysix,y,T) be the solution of 
H = E (t = —Xn)- Let an and avs be the a-function for H and Ys respectively, then 
for aYsic) G [yn,yt] we have ic,aYsic)) G a^^iE). 


The c-equivalence to be set up is in the new version introduced in m- The concept 
of the equivalence was first introduced in |M93] . bnt it does not apply to interesting 
problems of autonomous system. We call Sc non-degenerately embedded (n — 1)- 
dimensional torus by assuming a smooth injection cp: —)• T” such that Sc is the 

image of ip, and the induced map //i(T”'“^,Z) ifi(T'^,Z) is an injection. 

Let C C Lf^(T”,M) be a connected set where we are going to define c-equivalence. 
For each class c G Cl, we assume that there exists a non-degenerate embedded (n — 1)- 
dimensional torus Sc C T” such that each c-semi static curve 7 transversally intersects 
Sc. Let 

Vc = ^{iu*HiiU, M) : is a neighborhood of^(c) n Sc in T”}, 

u 

here iu\ U ^ M denotes inclusion map. Vj: is defined to be the annihilator of Vc, 
i.e. if c' G iL^(T"', M), then c' G V;f if and only if (c', /i) = 0 for all h G Vc. Clearly, 

V^ = |^{ker : [7 is a neighborhood ofA7(c) n Sc in T""}. 

u 

Note that there exists a neighborhood U of A7(c) n Sc such that Vc = iu*HiiU,"E.) 
and V;f = kerf^ (see [M93] 1. 

Definition 6.1 (c-eqnivalence). We say that c,c' G iL^(M,M) are cohomologically 
equivalent if there exists a continuous curve T: [0,1] —)> Cl such that r(0) = c, r(l) = c', 
Q:(r(s)) keeps constant for all s G [0,1], and for each sq G [0,1] there exists e > 0 such 
that r(s) — r(so) G '^r{so) whenever s G [0,1] and |s — sqI < c. 














66 


CHONG-QING CHENG AND JINXIN XUE 


Lemma 6.1. For C^-generic V 2 , there exists Aq > 0 as well as suitably small numbers 
£0 > 0, (53 3> (54 3> • • • » > 0, such that for each e G (0, eo)? o^ny two points on the 

circle TE) are c-equivalent provided 



■'n—3 


l|2 

1^2 


G {dSs, Aq — dds). 


{see the paragraph before Corollary 6.1 for the relation between c* and 


c. 


Proof. We extend the configuration space from T” ^ to the extra dimension is for 
Xji- Due to Theorem |6.2| and Corollary |6.1[ we have 


C |J/c,i X |(x,Xn) G T*" 


2ll^n-3ll^2 ^ 


where /c,i is given in Proposition 6.1 with (c, c*) G aY^ {E) and E — ^ 

{663, Aq — dSs) 

In the configuration space we obtain a section S 5 x Obviously, the first 

homology of O (Sg x is at most (n — 2 )-dimensional: 

Vc = span(e 3 ,e 4 , • • • , 6 ^) with c = (c, c, (ay^(c, c)). 

For any two classes c,c' G THn-i{c*,E) with E — 5 l|c*_ 3|||2 G ((i( 53 , Aq), one can see 
the condition (6.4) holds, namely, c — c' ^ V^. This proves our claim. □ 


Remark 6.1. Let us explain how the orbits constructed via cohomology equivalence 
look like in the phase space. Let us consider Hamiltonian systems of two degrees of 
freedom of the form H{x, y) = \^{y, Ay) + V{x), {x, y) G T*T^. On the zeroth energy 
level we assume that there is one unique hyperbolic fixed point, and for simplicity, that 
there are three edges on the boundary of flat of the a function corresponding to three 
homoclinic orbits. In a neighborhood of the hyperbolic fixed point where no KAM tori 
exists, we expect to get small oscillations of the variable y in a controlled way. Indeed, 
consider energy levels slightly higher than zero, there are three hyperbolic periodic 
orbits close to the corresponding three homoclinic orbits. Moreover, the three periodic 
orbits all get close to the hyperbolic fixed point as the energy decreases to zero. For 
sufficiently low energy, the stable manifold of one periodic orbit intersects transversally 
the unstable manifold of another and vice versa (H2). In this setting, the Mane set 
does not cover the whole torus so that we get a path in the M) which is the level 

set of the a-function along which all the cohomology classes are equivalent. In the 
phase space by the A-lemma, we get orbits moving from one periodic orbit to another. 
Actually, something even stronger can be done. If we label the three periodic orbits 
by 1, 2, 3, then for all prescribed bi-infinite symbolic sequence in {1, 2, 3}^, there is an 
orbit visiting the three periodic orbits according to the given sequence ( |GT1[ |L] ). 


6.3. Overlapping property. We are going to show that the NHIC after all the n — 2 


steps of reduction enters the annulus A{Aq,E) (see Formula (6.5)). 


For the system G defined in (3.4), there exists an annulus A of width Aq surrounding 
the flat Fq, see Corollary (6.1), which admits a foliation of c-equivalent circles. We 


claim that the NHIC enters the annulus region A in the sense that the NHIC contains 
some Mather sets for those first cohomology classes lying in A. Indeed, we first k now 
the value Aq for the system G before further reduction of order. Then, Proposition 3.1 


is applied to introduce the canonical action-angle variables restricted on the cylinder 
all the way to the homoclinic orbit (all the way to the cIF in the R^(T"', M)). In virtue 
of the estimate of the derivatives of the canonical transformation in Lemma 13.41 we 
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choose the lower bound 7 in Proposition 3.2 and Remark 3.2 such that the energy 
level /i = 7 < Ao /2 and fix it as the lowest allowable energy level for us to do the 
reduction of order. This 7 is chosen to be independent of any of <5*, i > 3. 


As all periodic orbits in such a cylinder share the same class, e.g. g G ffi(T^,Z), 
and each of them support a minimal measure for certain cohomology class, all of these 
classes make up a channel in which is denoted by Wg = Uxyo^i 3 ^{Xg). 

Because of Lemma |5.1t it has certain width which may vary with A > 0. When we 
turn on the 5i, i >3 perturbations in Y 5 , the channels may become disjoint from the 
disc. However we have the following 


Proposition 6.2 (Overlap property). Consider the systemXds- There exists a positive 
number 5* > 0 such that both arms of the channel C intersects the annulus-shaped 
region: C n A(Ao, E) ^ 0 provided 0 < 63 < 5^,. 


Proof. For one step of order reduction, one has Hamiltonian of the form (3.30) where 
h is well-defined provided /i > 7 . Applying the result in ICZII o ne obtains NHICs in 
{h > 7 } if one ignores the term kg{5Ri{x) +e^Rjj{x,y)) in (3.30). Applying Theorem 
of normally hyperbolic invariant manifold one obtains overflow NHICs, restricted on 
which the system turns out to be nearly integrable with two less degrees of freedom 
after one introduces new action-angle variable again. By applying a result of |BLZ| 
on overflow NHICs, for sufficiently small 62 ^ 64 ^ ^ dn-i S> e > 0, a three- 

dimensional NHIC exist which contains Aubry set for those c G C such that a{c) > 
7 + 2 (Ao — 7 ) (similar to the proof of Proposition 5.1 of |C15aj L □ 


6.4. Construction of the ladder. 

outlined in Section 


6.1 


We have completed the first step of the strategy 


The aYs is a 63 perturbation of oyo = d(c) -|- |||c|p. We first 
move along one arm of the channel C until the energy of the subsystem G becomes in 
(dfis, Ao — d 63 ) at which time we fix c = c* and apply the c-equivalence mechanism 
in the previous section to move c —>• —c. We remark that it is enough to send c into 
a small 053 -^ 0 ( 1 ) neighborhood of — c so that the image lies in C. In this section, we 
work on the second step in the strategy to build the vertical curve in Figure that 
we call ladder. 


In the system Y 5 , we set <^4 = (js = ... = (5^ = 0 to obtain the following system by 
discarding the integrable part 

( 6 . 6 ) Y3^s3 = ^(-4y,y) + ^2(x) + 63Y3{x3) 

defined on T*T'^ x M^, where x = (xi,X 2 ), y = (yi,y2), X 3 = (xi,X 2 ,X 3 ), y 3 = 
(yi)y 2 )y 3 )- The system Y 3 ^S 3 is obtained from 

(6.7) Y3^s3 = + V 2 {x) -k53V)5(x3) 

defined on T*T^ via the following linear transformation 




53 2/3 = ya, 


83^73 = X 3 , 

where S 3 is the first 3x3 block of S. (See the bracketed terms in (3.41) in Section 


3.10 and (4.13) with j = 3 in Section |4 .2.3[). Next, we perform the reduction of order 


in the subsystem G = 5 (Ay, y) -|- V 2 (x) in Y 3^53 to get a system defined on x ! 


62,53 = HI ) + 2^3 + HZi , 3 { I , P , X3) 


(6.8) 
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where ^ 1^3 is obtained from V 3 by restricting to the NHIC. 

We use the linear transformation S 3 (<y 9 , X 3 ) := {ip,xs), ys) := (J, 2 / 3 ) where 


83 = 


S3 

\d)3 


0 

1 

vd)3 


to get a system defined on T*T^ 


(6.9) 


G 2,53 = HJ + ^ 2 / 3 ) + x2/3 + ^3^1,+ ^=-2/3, V 63 X 3 - S 3 <p) 


S3 


Vb-. 


Vbs 


To see that the resulting system is defined on T*T^, it is enough to notice that 02,83 
is a rescaling of the bracketed term in (4.17), which is defined on T*T^. 

Next, let us look at the effect of the reduction on the a function and the cohomology 
classes. Consider a for the system G first. It has a channel C corresponding to the 
NHIC in the phase space with homology class (1,0) (see Figure [^. The NHIC is 
formed by periodic orbits on different energy levels of G. On each energy level there 
is only one periodic orbits except for finitely many energy levels there are two. Each 
hyperbolic periodic orbits corresponds to an interval in 77^(T^, M) according to Lemma 


5.1 hence the channel is foliated into intervals lying on different energy levels of the 


a. We can introduce new coordinates cy, c_l to reparametrize the channel C such that 
d(c||,c_L) = ci(cj|,c_L) and d(c||,c_L) 7 ^ d(c||,c'_j_) for c_l / c'_j_. Namely cy parametrizes 
the short interval on an energy level and c± parametrizes the energy levels. 

Recall that we restrict to the NHIC in the system G to get a system of one degree of 
freedom that is integrable. The reduced Hamiltonian system h{I) is its own a function 
for integrable systems. This reduction procedure amounts to identify Q:(cy, c_l) = h(l') 
and / = c_L. 

The a functions ay-g ^ are defined in the standard way and we define the a 

functions for '^ 3,831 62,^3 as 


■ 3,53 

The same argument as Lemma 
the Hamiltonian 


«Y3.3, = S 3 *avg ,5g, 


S i* ^ 

'^ 2,53 ^ 2,53 


5.2 


Og^.o = + ^^=1 

and that the function ol^ is a ^3 perturbation of a 

'^ 2,53 

reflection symmetries 

, {c± , C 3 ) ^G2 0 (W) ”^ 3 ) a, 


gives that the a function for 62,0 is the same as 
1 


a. 


G 2 ,( 


r, . The function has 

G2,0 G2,0 


G2,o^ WWs)— aG2,o^ 


We define the first piece of ladder by (see Figure [db]) 

(C±,C3) = E, 


^ 3 ( 03 ) := {(c±,C3) I 


2.■53 


|C3| < IC 3 I, C_L < 0} 


for the system 62 , 5 , which transforms to a piece of ladder (see Figure 4a) for the 
system 02,83 S^ Moreover, since we have taken quotient of cy, in system ^ 3,83 
before the reduction, this ladder L 3 (c 3 ) is embedded in a piece of two dimensional 
surface which is a product of L 3 (c 3 ) with intervals. We use the same notation L 3 (c 3 ) 
to denote the ladder for all the systems Ts^^g, 02,83 ^ 2,53 without introducing new 

notations. 
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The next lemma shows how to move the third component of the cohomology class 
along the ladder. 


Lemma 6.2. Under generic perturbation in the system (6.7), each Aubry set 
A{c) for c G 11^3(03) can be connected to another Aubry set A{c') nearby if c' G L3(c3). 


Proof. The system 13,(53 of three degrees of freedom, when restricted on its NHIM, 
becomes a system 6 * 2,,53 of with two degrees of freedom. Both Anbry seta A{c) and 
A{c') are located in a 2-torus. We choose the covering space T x 2T x T 9 (xi, X 2 , X 3 ), 
where each Aubry set is lifted to two copies. So what we need to check is that the 
Mane set does not cover the whole T x 2T x T. 


For the reduced Hamiltonian restricted on an energy level set, since it has 

two degrees of freedom, all elementary weak KAMs can be parameterized by certain 
“volume” cr so that this family of weak KAMs is ^-Holder continuous in a G / (see 
Theorem 9.2), where / is a closed interval. We choose a subset A C / in the following 
way: u G / if and only if the weak KAM is (must be also [FSl IBe4| 1. i.e. 
the Mane set is an invariant 2-torus. For c ^ A, certain section So- of 2-torus exists 
such that AA H Fq- is shrinkable. 


Since the reduced system G 2,53 lies on a normally hyperbolic manifold, we find that 
the barrier function of the system is g-Holder continuous in cr by applying 
Theorem |9.3[ This property allows us to show the following holds for generic 63 V 3 
(perturbing V 3 at the place away from the NHIM) 


for all fj G A, each connected component 0/ Argmin{Ho-, So,o-\Um-Nm} is contained 
in certain disk Om C So,o-, 

where So,o- is a 2 -dimensional section of 3 (xi, X 2 , x^) which is transversal to 
c(cj)-semi static curves, UmAlm denotes a neighborhood of the Aubry set in the finite 
covering space, ArgminjHo-, So\Um-Nm} denotes the set of minimal points of which 
fall into the set So,cr\ Um Nm- 

The proof of this statement is the same as in the proof in Section 8.3 of |C12j . We 
can not apply the argument in [CYll [CY2] . as the perturbation < 53 V 3 is only allowed 
to depend on x. 


Therefore, the Aubry set can be connected to another one nearby, either by connect¬ 
ing orbit of type-/i when the Mane set is a 2-torus (Arnold’s mechanism of intersection 
of (un)stable “manifolds”), or by type-c when the Mane set does not cover the whole 
2-torus (Mather’s mechanism of crossing the Birkhoff instability region) (For the defi¬ 
nition of local connecting orbit of type-h with incomplete intersection as well as type-c, 
refer to Section 7.1). □ 


In the next remark, we explain our mechanism of ladder climbing. 


Remark 6.2 (The diffusion mechanism for the ladder climbing). Here we employ a 
variant of Arnold’s mechanism (1.2). Consider Hamiltonian system of three degrees 
of freedom of the form 

R = Y + Y + Y + (COSX3 - 1)(1 -h e(cosxi -k sinx2)). 


In this system, there exists diffusing orbit for each E > 0 such that {yi,y 2 ) stays 
close to the circle {yf + y 2 = 2-E'} and arctan ^ achieves any value in [0, 27r). Loosely 
speaking, {yi,y 2 ) moves along the circle {yf + y 2 = 2E}. In the example, the set 
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(b) The ladder for 62,53 


Figure 4. The ladder construction 


{ 3^3 = 7/3 = 0} is a normally hyperbolic invariant manifold and the e perturbation 


vanishes on the manifold as in Arnold’s example (1.2).This can be considered as a 


system of a priori unstable type. One can compute the Melnikov integral as in the 
Arnold’s example to verify that Arnold diffusion exists. In our case, the system 13,53 
plays the role of H here and the system 02,53 pla-ys the role of ?/i + yl • 

Remark 6.3. Since we have required |I| > 7 for some 7 < Ao/2 to avoid the singular 


behavior of the coordinates (I, ip) estimated in Lemma 3.4 Due to the convexity and 
symmetricity of the a function, the ladder never touches the forbidden region |I 
(see Figure [ 4 ^. 


< 7 


Lem ma |6.2| enables us to move C 3 from Cg to — Cg. In the next theorem, we apply 
Lemma 6.2 repeatedly combined with our reduction of order scheme to build ladders 
to move all the components of c*_3 —>■ — c*_g. 


Theorem 6.3. For the Hamiltonian of Ys (4.19) with generic ^ 2 , 1 / 3 , 
small E > 0, there exists 0 < 6 n-i 'C 


,V^_i and 


c, c 


(54 <C (53 such that following holds: 

Let C, C' be two channels connected to the flat Fq defined in (5.9). For a class 
G CnaY^{E), there is (c',c') G C naY^{E) which is connected to a class (c*,c) 
by a ladder L. For any two classes c*,c" G L with small \c* — d'\, the Aubry sets 
A{c*) and A{c") are connected by local minimal orbit either of type-h with incomplete 
interseetion or of type-c. 


Proof. We again work with the system Y 5 then transform the obtained ladder to the 
system Ys using the linear transformation induced by S. Applying Lemma [ 6 .2[ we get 
a ladder L 3 connecting two points (—c^jCg) and (—c^jCg) in system 62 , 53 . Keep it 
in mind that such a ladder is constructed for fixed c*_ 4 . 


For the construction of the ladder moving C 4 from to —c^, we recover the potential 
perturbation 64 V 4 . For generic V 4 , the reduced Hamiltonian can be further reduced 
to the following (see (4.21) with i = 2) 


Y2,5 = [^3(-^3 + 5474) + ^&4yl + 54-^1,4(^3 + S4y4, 

n—\ 

+ E 

k=5 


yyi + '5fc^i,fc(-^3,<^3,a:4,X5 


) Xfc) ] + SnR2- 


( 6 . 10 ) 
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where the bracketed term is the restriction to the NHIC in the system defined on T*T^ 


(see the bracketed term in (4.13) with j = 4) 


^ 4 , 53,54 = + V2{x) + dsVsixs) + dsY^ix^). 


By applying Lemma 6.2 again, we obtain a ladder L 4 connecting two cohomology 


classes (c_l, —Cg, ( 


, 4 , 1.5 ..., I 


;_i) and (c_l,-c 5 ,-c^,( 


■'n-D 


By induction combined with (4.21), we obtain n — 3 ladders L 3 , 
them simple ladder. 


■ ,L„_i, we call 

The composition of these simple ladders L = L„_i * • • • * L 3 


connecting two points (—c, c*) and (—c, —c*). We remark that it is enough to send 
the cohomology classes into a neighborhood in C of (—c, —c*). 

To complete the argument for the construction of the ladders, we are required to 
show the ladder survives perturbation Yl^=l+i each j = 3, • • • , n — 2. We 

use Theorem 7.2 to prove it. 


Because of upper-semi continuity of Mane set (see the proof of Corollary | 6 .1 1 for how 
to apply it to Ys), in the finite covering space T x 2T x the following condition 

holds for each c G L 3 provided 6 j <C ^3 for each j G {4, • • • ,n — 1}: each connected 
component of ArgminjBo-, (To,o-\ Um -Wn) x is contained in certain Om x 

where Om C is a 2-dimensional disk. The conditions of Theorem 7.2 are satisfied. 
So, L 3 survives perturbation. For other Lj with j G {4, • • • ,n — 2}, the proof is the 
same provided 6 j+i ^ dj. □ 


In this way we have established a transition chain to cross the m-strong resonance. 

implies 

the existence of diffusing orbit with cohomology classes changing along T^. 

In the next remark, we explain the frequency space dynamics. 


We will show in Theorem 7.2 in Section^ that the conclusion of Theorem 6.3 


Remark 6.4. Let us keep track of the frequency vector. As a varies, let us consider 
the frequency line ujt = Ai(a, ..., We set A| = 1 for simplicity and suppose 

a = a" corresponds to a complete resonance. For a —a" = A, we have Ua = + Aei. 

To cross the complete resonance amounts to move the frequency such that A crosses 
zero. We are unable to move A from positive to zero to negative or vice versa directly. 
Instead, we take a detour as follows. We first move Ua to some point where A is 
sufficiently small. We next look at the system Y ,5 in which the frequency becomes 
SAei = A(l, 0, AT,..., Af-). The c-equivalence mechanism amounts to send the first 

VO3 VOn 

two components (A, 0) —>• (—A,0) via a path similar to A(cos 0, sin 0), 9 G [0,7r]. The 
resulting frequency vector for the original system is now 


+ AS-1 (-1,0, 


S 3 


Vb3 ’ ’ Vbr^ 


= J„-as-Mi,o, 


S 3 


Vb3''"'Vbr. 


+ 2AS”^ (0,0 


S 3 


Vbs ’ ’ Vbr. 


= +2AS-Mo,o, 


S 3 


V^3 ’ ’ Vbr^ 




This error 2AS ^ (o, 0, ..., j in frequency corresponds to the vertical mis¬ 

alignment of cohomology classes in Figure Our ladder are designed to kill this error 
step by step. 
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Figure 5. The frequency space picture corresponding to Fig[^ 

The black lines oJa lies in the plane k'-*- (the black plane). The line 
intersect another line (k'-*- n k"-*-) at a strong double resonance. The 
red plane is spanjk', k"} since around the strong double resonance, the 
leading term of the potential is F((k', x), (k", x)). The red path lies in 
the red plane (c-equivalence mechanism). The blue path (the ladder) 
lies in k'-*- hence has normal hyperbolicity due to the resonance kh 

7. Variational construction of global diffusion orbits 

Global diffusion orbits are constructed shadowing a sequence of local connecting 
orbits end to end. There are two types of local connecting orbit, one is called type-h 
as which looks like a “hetroclinic” orbit, another one is called type-c as it is constructed 
by using “cohomology equivalence”. 

7.1. Local connecting orbit of type-h with incomplete intersections. For an 

Aubry set, if its stable set “intersects” its unstable set transversally, this Aubry set is 
connected to any other Aubry set nearby by local minimal orbits. It can be thought as 
a variational version of Arnold’s mechanism, the condition of geometric transversality 
is replaced by the total disconnectedness of minimal points of the barrier function. 

However, this condition is not always satisfied for the problem we encountered here. 
The stable set may intersect the unstable set on a set with nontrivial first homology, 
i.e. incomplete intersection. In this section, we design a new method to handle this 
problem. Let us first formulate a version for time-periodic dependent Lagrangian. 

Recall the definition of the function introduced in |M93j 

/i“(x, x') = liminf inf f (L{^{t),'y{t),t) — {c,'y)+a{c))dt. 

k^OO 7 (-fc)=a; J _}^ V / 

'y(k) = x' 

This function is closely related to weak KAM. Indeed, for x G Ac^i\t=o (the time-1- 
section of the Aubry class Ac,i C ^(c)) we have 

/i“(x,x') =u“i(x') (x), 

where both and uA are the time-I-section of backward and forward elementary 
weak KAM respectively (see the Appendix A.3 for details). It inspired us to introduce 
a barrier function for two Aubry classes Ac,i and Ac,j 

= u-j{x) - uA{x). 
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Passing through its minimal point there is a semi-static curve connecting these two 
classes, provided this point does not lie in the Aubry set. 

If the Aubry set contains only one class, we work in certain hnite covering space so 
that there are two classes. For example, if the configuration space is and the 

time- 1 -section of the Aubry set stays in a neighbourhood of certain lower dimensional 
torus, .4o(c) C + we introduce a covering space x x 2T. With respect 
to this covering space the Aubry set contains two classes. 

We introduce some notation and conventions. For the product space we use 

’]pi+^ = {x E . 2 ;^ _ Q y ^ j _|_ Given a set S, a point x and a 

number d, 5 -|- x denotes the translation of S by x, i.e. 5 -|- x = {x' -|- x ; x' E 5} and 
S + 5 denotes (5-neighborhood of 5, i.e. S + 6 = {x : d{x, S) < 5}. A set N is called 
neighborhood of (j, ^)-torus if it is homeomorphic to an open neighborhood of (j -|- i)- 
dimensional torus whose first homology group is generated by {cj : i = 1 , • • • ,j,j + k + 
1, • • • ,j + k+£}. Given a function B, we use Argminji?, 5} = {x E S' : B{x) = mini?} 
to denote the set of those minimal points of B which are contained in the set S. 

Theorem 7.1. For a time-periodic -Lagrangian L : x T —)• M and a first 

cohomology class c E M) we assume the conditions as follows: 

(1) the Aubry set A{c) contains two classes {Ac,i, Ac/} which lie in a neighbour¬ 
hood of {j,£) torus Ac,i\t=o C Ni and Acy\t=o C Aj/. These neighborhoods are 
separated, i.e. Ni n iVj/ = 0 ; 

( 2 ) there exist topological balls {Om C with Om H Om.' = 0 for m m', 

each connected component of 

Argmin{5e,iy, T^■+^+^W U 

is contained in certain Om x 

Then, for c' E M) satisfying following conditions 

(1) (c' - c, y) = 0 holds V y E iii(T^'+*^+^TJ+^,Z) and \c' - c| < 1; 

(2) the Aubry set A{fi) C N Li Ni/; 

there exists an orbit ( 7 , 7 ) of (jfj^ which connects A{c) to A{c') in the following sense, 
the a-limit set of ( 7 , 7 ) is contained in A{c), the u-limit set of ( 7 , 7 ) is contained in 
A{c') or vice versa. 

Remark 7.1. If £ = 0, the set Argmin{i?c,i,i', Aj/} is topologically trivial, 

it implies the stable set “intersects” the unstable set completely. Therefore, it turns 
out to be a variational version of Arnold’s mechanism. 

Remark 7.2. If the Aubry set consists of one Aubry class, we study this problem 
in certain covering space so that the Aubry set consists of two classes. The second 
condition for c can be weakened so that the result becomes sharper, but the condition 
here is easier to verify and good enough for our purpose. Because of the upper semi¬ 
continuity of Mane set in the first cohomology class, the Aubry set A{c') is also 
contained in neighborhoods of these lower dimensional tori. 

Proof. It is proved by exploiting the upper semi-continuity of Mane set with respect 
to perturbation on the Lagrangian. As A{d) C AjU Aj/, without lose of generality we 
assume A{c') n Aj/ / 0 . 
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Given a ball Om there exists small e such that Om + e does not touch other balls. 
Let Ti: M —)• [0,e] be a smooth function such that Ti{t) = 0 for t G (—oo,0] U [l,oo), 
Ti{t) > 0 for f G [0,1] and maxri = 1, Let T 2 '- M —)■ [0,1] be a smooth function such 
that T 2 {t) = 0 for t < 0 and T 2 {t) = 1 for t > 1. Let v. [0,e] so that 

v{x) = 0 if X ^ {Om + e) X and v{x) = e if x G Om x As (c' — c,g) =0 for 
each g G Z), 3 smooth function u G T'^+^+^ —M so that du = c' — c 

when it is restricted in {Om + e) x and du = 0 if x ^ {Om + 2e) x T^. 

We introduce a modified Lagrangian 

Lc,v,u{x, X, t) = L(x, X, t) — (c, x) — Ti(f)u(x) — T 2 {t){c' — c — du, x) 
and consider the minimizer 7 ^- ^+: [—k~, k~^] —)• M of the action 

|•k+ 

hc,v’X ix~,x~^) = inf / Lc^^^u{'l{t),i{t),'t)dt + k~a{c) + k'^a{c') 

l(-k-)=x- J — k- 

where x G Ac,i\t=o and x' G Ac',i'\t=o- As the Lagrangian is Tonelli, for any large 
T, the set of the curves {'yk\[-T,T] : /c"'' > T} is C^-bounded, therefore it is C^- 

compact. Let T —)■ 00 , by diagonal extraction argument, we can find a subsequence of 
7 fc. which converges C^-uniformly on each compact interval to a C^-curve 7 : M —)■ M 
which is a minimizer of on any compact interval of M. 

Let ^{Lc^^^u) denote the set of minimal curves of it follows from the above 

argument that the set ^{Lc^y^y) is non-empty. Restricted on (— oo,0] as well as on 
[ 1 , 00 ), each curve in ^(Lc,i;,n) solves the Euler-Lagrange equation for L since ri = 0 
and (c' — c — du, x) is closed. We are going to show that it also solves the equation 
for t G [0, 1]. 

If both Ti and T2 vanish, each curve in the set ^{Lc^v,u) is nothing else but a c-semi 
static curve of L. These curves produce orbits which connect Ac,i to Ac^i'- Consider 
all semi-static curves which intersect Om x at t = 0. As Om x is open, the set 
of semi-static curves is closed, 3 small > 0 such that these curve intersect Om x 
also for t G [ 0 ,f 5 ]. If we set n = 0 for t G (—oo,0] U [^ 5 , 00 ) and set T 2 = 0, these 
semi-static curves solve the Euler-Lagrange equation produced by Lc,v,u- As a matter 
of fact, along these curves the function v keeps constant when ri 7^ 0 , the term rix 
does not contribute to the equation. Clearly, the action of Lc,i;,n along these curves 
is smaller than those semi-static curves which do not pass through Om x around 
t = 0. Since is no longer time-periodic, a time-l-translation of its minimal curve 

is not necessarily minimal, i.e. 7 G does not guarantee k*'y G ^^{Lc^y^u) for 

/c G Z, where k* denotes a translation operator such that k*T{t) = T{t + k). 

Next, let us recover the term T2- Because of upper semi-continuity, the minimal 
curve of Ly^y^u must pass through Om x if d is sufficiently close to c. Again, along 
these curves, the term T2du does not contribute to the Euler-Lagrange equation, along 
these curves du = c' — c when T 2 G (0,1). 

Obviously, the orbit produced by each curve in the set ^{Ly^y^y) takes A{c) as its 
a-limit set and take A{c') its cu-limit set. □ 


The orbit ( 7 , 7 ) obtained in this theorem is locally minimal in the following sense: 
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Local minimum: There are open balls V- , Vj~ and positive integers t , t~^ such that 
V- C Ni\Aoic), V+ C N,.\Ao{d), l{-k-) E 1 /^, 7(fc+) E V+ and 

hf{x~,mo) + + /i^(mi,x+) 

/ fc+ 

Lc,v,u{d'y{t),t)dt — a{c) — kf a{c!) > 0 

-h 

I I 

k'!^ —>-oo 

i 

holds V (mo, mi) E d{V~ x Vj~), x~ € NiD 7Tx{a{d'j))t=o, x+ E iV*/ n 7 rj,(a;(d 7 ))|t=o, 
where fcE, kf E Z"*" are the sequences such that '^{—k~) —)> x~ and 'y{kf) —)> x^. 

The set of curves starting from and reaching Vf~ with time k~ + k~^ make up a 
neighborhood of the curve 7 in the space of curves. If it touches the boundary of this 
neighborhood, the action of Lc,d,u along a curve ^ will be larger than the action along 
7. The local minimality is crucial in the variational construction of global connecting 
orbits. 

Next, we formulate the theorem for autonomous Lagrangian. As the Lagrangian is 
independent of time, one angle variable plays the role of time. Given a first cohomology 
class, some coordinate system exists G~^x such that uji{p,) > 0 for each ergodic c- 
minimal measure /r if a{c) > mina, where we use a;(/r) = (a;i(/r), • • • , ujnih)) to denote 
the rotation vector of the invariant measure (see [Lx]). For this purpose, we work in a 
covering space it : M = ]Rx7r_iM, where 7r_i denotes the operation to eliminate the 
first entry, 7r_i(xi, X 2 , • • • , Xm) = {x 2 , • • • , Xm), the dimension M is for the coordinate 
xi, M = X X 2T if the Aubry set consists of only one class which stays in a 
neighbourhood of (j, I’)-torus and M = '1’-^+^+^ if the Aubry set contains two classes. 

Theorem 7.2. For the autonomous -Lagrangian L: and the first 

cohomology class c E M) we assume the conditions as follows: 

(1) > 0 holds for each ergodic c-minimal measure. 

(2) the Aubry set A{c,M) contains two classes {Ac,i, Ac^i'}, both stay in a neigh¬ 
bourhood of {j,i) torus, i.e. Ac,i C Ni, Ac^i' C A^j/. These neighborhoods are 
separated, i.e. AjniV^j/ = 0. The lift of both Ni and Ni/ to M is still connected 
and extends to xi = ±00; 

( 3 ) there exist topological disks {Om C 7r_i(Tt x T*^)} with Om H Om' = 0 for 
m A m', such that each connected component of 

Argmin{5c,i,i/, So\Aj U Ni/} 

is contained in certain {xi = 0} x Om x T^, where So = {xi = 0} x 7r_iM is 
a section of M. 

Then, for c' E M) satisfying following conditions 

(1) a(c') = a{c); 

( 2 ) (c' - c, £/) = 0 holds V 5 E and \c' - c| < 1 ; 

( 3 ) the Aubry set A{c') C Aj U Aj/; 

there exists an orbit (7,7) of 4>\ which connects A{c) to A{c') in the following sense, 
the a-limit set of (7,7) is contained in A{c), the co-limit set of (7,7) is contained in 
A{c') or vice versa. 
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Remark 7.3. For autonomous system, barrier function keeps constant along minimal 
curve. The intersection of minimal curves of autonomous system with the section Sc 
is an analogy of .4o(c) and A/o(c) for time-periodic system. 

To prove this theorem and establish an analogous inequality of 
notations and definitions. A Lagrangian L: TM —>■ M is called space-step if there exist 
Lagrangian G M), such that L~{xi, ■)\(^_ao-s) = T(a:i, •)|(_cc,- 5 ) 

and •)|(5,oo) = L{xi, •)|((S,oo) where we treat L^: TM —)> M as its natural lift to 

We assume some conditions: 

(1) > 0 for each ergodic minimal measure of respectively; 

(2) min dr- = mindr + , without losing of generality, it equals zero; 

(3) \L- -L+l < imini^i=o{^L-(w'),/3L+(w')}- 

As the minimal average action of is achieved on supp/r^ with = 0, one has 

min^^(j,)^o / L^dv > min f L^du, so the third condition makes sense. To introduce 
the concept of minimal curve for space-step Lagrangian, we define 

/i|^(mo,mi)= inf Vmo,miGM, 

7{-T)=mO 

7{T)=mi 

where ^ 

^L(7l[-r,T]) = j ^Li7{t)Ait))dt. 

To generalize semi-static curve to space-step Lagrangian, we first define a set 
of minimal curves. Because of the following lemma (Lemma 6.2 of [C12) . one can find 
the proof there) 

Lemma 7.1. If the rotation vector of each ergodic minimal measure has positive first 
component > 0 , mo 7 ^ rhi, then 

hm k^{rhQ^rhi) = 00 and ^im hi{rho,fhi) = 00 . 

Consequently, the following definition makes sense 
Definition 7.1. A curve 7 : M —)• M is in ^{L) if 

^l(7|[-t,t]) = inf /i^'(7(-r),7(T)). 

1 GM_|_ 

The set ^{L) is nonempty. Denote by jL{-,rhQ,rhi) : [—T,T] —)• M the minimizer 
such that ^l{—T) = mo, fhiT) = th-i and 

A(7l) = / L{jL{t),^L{t))dt= inf h^' 

J-T T'm+ 

Because of Lemma \7A\ this infimum is attained for finite T > 0 if mo and mi are 
two different points in M. The super-linear growth of L in i; guarantees that T —>• 00 
as —moi,mii —)■ 00 , where fhn denotes the first entry of rhi for i = 0,1. Given an 
interval [—T,T], for sufficiently large —moi,mii, the set is pre¬ 

compact in C^{[—T, T], M). Let T —)• 00 . By diagonal extraction argument, there is a 
subsequence of { 7 i;,(-, mo, m-i)} which converges C^-uniformly on any compact set to 
a C^-curve 7 : M —)> M. Obviously, 7 G ^(L), and it is shown in |C12j that 

Proposition 7.1. Some number AT > 0 exists so that \hJ^{^{—T),^(T))\ < K holds 
simultaneously for all curve 7 G ^{L) and all T > 0. 


(7.1), we need some 
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Each k € Z defines a Deck transformation : M ^ M: kx = [xi + k, X 2 , • • • , Xn)- 
Let = {x £ M : xi < —<5}, M/ = {x £ M : xi > 6}. With this notation we 
are able to define the set of pseudo connecting curve which is proved nonempty and 
upper semi-continuous with respect to the Lagrangian (see the proof of Lemma 6.3 
and Theorem 6.3 of [C12] l 

Definition 7.2 (pseudo connecting curve). A curve 7 £ ^{L) is called pseudo con¬ 
necting curve if the following holds 

AL{j\[-T,T]) = inf hl'{k~^{-T),k^j{T)) 

k-7(-T)eM“ 

k+7(T)GM+ 

for each 7 (T) G Mf and ^{T) £ . Denote by '^{L) the set of all pseudo connecting 

curves. 


Obviously, if the space-step Lagrangian L is periodic in xi, then a curve 7 G ^{L) 
if and only if its projection 7 = ttj: M —)• M is semi-static. 


Proof of Theorem In autonomous system, A{c) can be connected to .4(c') only 
if a{c) = a{d). If c, c' G a“^(mina), then A{c) n .A(c') / 0 (see |Msj 1. it is trivial 
to connect an Aubry set to itself. So, we only need to work on the energy level set 
with E > min a. Under this condition, there exists coordinate system so that 
oji (/i) > 0 holds for each ergodic minimal measure of c and d if they are close to each 
other. 


The section So separates M into two parts, the upper part M~^ extending to {xi = 
00 } and the lower part M~ connected to {xi = — 00 }. Denote a ^-neighborhood of 
So in M by So -|- d, we introduce a smooth function y G C^(M, [0,1]) such that x = 0 
if X G M“\(So -I- (5), X = 1 if £ dif~^\(Zo + 5). 

For those d such that (d — c,g} = 0 holds for each g £ Z), there 

exists smooth function n : M —>■ M so that du = d — c ii x £ {\xi\ < e} x {Om -|- e) x 
and clu = 0 if X ^ {|xi| < 2e} x {Om + 2e) x T^. 

Without lose of generality we assume A{d) n iVj/ / 0 . We consider the set of semi¬ 
static curves which generate orbits connecting the Aubry class Ac,i to another class 
Ac^i'. The lift of the curves to M intersect the section So in the set Argmin{ilc,i,i 7 ^o}- 
We pick up a connected component of this set contained in certain tubular domain 
Sii' = {xi = 0} X Om X Let jiii{t,x) denote the lift of semi-static curves 'yii'{t,x) 
so that 7 jj/( 0 ,x) = X G Sa/. As all curve in { 7 M'(t,x)} take Ac,i as their a-limit set 
and take Acy as their w-limit set, jn'lt, x) is Lipschitz in x G Argmin{i?c,i,i 7 We 
extend these curves to the whole Su/, so that 7 * 4 /(t,x) is still Lipschitz in x although 
the extended curves {jii/(t, x)} do not generate orbits of cf^ 'dx ^ Argmin{i?c^j^j/, 

By deforming So —)• S' we can assume that these curves pass transversally through 
the section S'. 


Let s = s{yiii{t,x)) denote the arc-length of the curve from 7jj/(0,x) to jii'{t,x) 
in the Euclidean metric such that s(7jj/(0,x)) = 0 and s{yiii{t,x)) > 0 if t > 0. We 
approximate the function s by a smooth function s' in the tubular domain made up by 
the curves {yii'{t,x)} with 7jj/(0,x) G Sn'. Let r: M —>• [0,1] be a smooth function so 
that r = 0 if s < 0, r = 1 if s > So and f > 0 if s G (0, so). Let w £ C^{TM, [0,1]) such 
that w = 1 when (x, x) is restricted in {{jn'it, x),yiii{t, x)) : x G Su', s £ [0, So]}+5 and 
w = 0 when (x, x) does not lie in the set {{yu'it, x),yii>{t, x)) : x G Su', s £ [0, so]}+2(5. 
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Next, we are going to show the curves in ’^{Lc^v,u) produce orbits of (j)\^ connecting 
A{c) to A{c!), where the modified Lagrangian is defined as follows 

(7.2) = L - {c,x) - w{d{T o s'),x) - x{c' - c- du,x) + a(c), 

where x = 0 if x G M“\(So + <5), x = 1 if 3 : G M~^\{T,o + 6), the function s' is extended 
to the whole space in any way one likes because vTa^supprc is contained in the tubular 
domain where s' is well-defined, tTx- TM ^ M denotes the standard projection along 
tangent fibers. As the first step, let us set x = 0- Because of the upper semi-continuity 
of L —)• '^(L), each curve in ^{Lc,v,u) either is contained in {( 7 jj/(t, x),^iii{t, x)) : x € 
'S'm'} -|- <5 or keeps away from the larger tubular domain x), x)) : x G 

Sii>} + 25. As rc = 1 holds on the smaller tubular domain, the term {d{T o s'),x) 
does not contribute to the Euler-Lagrange equation. By definition, along each orbit 
(7jj/(t, x), 7jj/(t, x))|sg[o,so] one has (5(ros'), x) >0. Therefore, in the lift of { 7 jj/(t, x)) : 
X G S'**'}, only those curves are the member of ^(Tc,i;,n) for x = 0 if they pass through 
the section Sn'. Other curves in the lift are not in because they have larger 

Lagrange action. 

Next, we recover the term x{c' — c — du, x) which is C^-small. Due to the upper 
semi-continuity again, all curves in niust pass through Su/ if they connect 

Ac,i to Ac\i'. As du = c' — c \i X G {|xi| < e} x {Om -I- e) x and = 0 if 
X ^ {|a^i| < 2e} X {Om + 2e) x T^, one can see from the definition of x that the term 
x(c' — c — du, x) does not contribute to the Euler-Lagrange equation. It implies these 
curves produce orbits of (j)\ which connects A{c) to A{c'). □ 


The orbit ( 7 , 7 ) obtained here is local minimal in following sense (analogous to 7.1): 


Local minimum: there exist two {n — 1) dimensional disks C M and positive 

numbers T,d > 0 such that TtV~ C Ni\A{c), C Nii\A{c'), 7 transversally passes 
7 rV)“ and at the time —T and T respectively, and 


hf{x ,7rmo) -h ,, ,^(mo, mi) /i“( 7 rmi,x+) 

At _ , 

- hm / _Lc,v,ui 7 it),jit))dt-{t- +tf)a{c)>0 


holds V (mo, fh\,T') G d{V~ x 17+ x [T — d, T +d]), x~ G Nimrx{a{d'y)) and x+ G iVj/n 
TTx{uj{d'y)). Where —>■ 00 and tf ^ 00 are the sequences such that 'y{—t~) —> x~ 
and 7(t)^) —>■ x+. 


7.2. Local connecting orbits of type-c. Eor autonomous system, if d is equivalent 
to c with \c — d\ <C 1, then {d — c,g) =0 holds for all g G Hi{N{c) n Sc,Z) where Sc 
is a section of M. So there is a function u defined on the whole torus and du = d — c 
holds in a small neighborhood of M{c) H Sc. To connect A{c) to A{d), we work in a 
coordinate system G~^x so that wi(/ic) > 0 holds for each ergdic c-minimal measure. 
The new coordinate system Gc is chosen so that the lift Sc to the covering manifold 
M contains infinitely many compact connected components. We fix one component, 
denoted by Sj). Other components in the lift of Sc are obtained by translating this 
one by 2kTT in the direction of xi. The section Sj? separates M into two parts M~ and 
M+. In M+, the coordinate xi can be extended to ± 00 . Let sign be a sign function 
dehned as sign(x) = ±1 if x G M+. 
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Let Lc^u be a space-step Lagrangian defined on the covering manifold M 

(7.4) = L — {c, x) — x{c' — c — du, x) + a{c) 

where a{c) = a(c'), x = 0 if x G M~\{'Eo -|- <5), x = 1 if £ A7+\(So -|- 5). Obviously, 
for c' = c, we have Tt^{Lc,u) = A/’(c). According to the upper semi-continuity, for 
sufficiently small |c^ — c|, the image of each curve 7 G ‘^{Lc^u) falls in a small neigh¬ 
borhood of Af{c). Therefore, {d — c — du, x) = 0 holds along this curve when it passes 
through a small neighborhood of It implies that the term xW — c — du,x) does 
not contribute to the Euler-Lagrange equation determined by L. Therefore, this curve 
also solves the Euler-Lagrange equation for L. Clearly, approaches the 

Aubry set for class c' as t —)• 00 . Therefore, we have 

Theorem 7.3 (Theorem 6.4 of [C12j : connecting orbits of type-c). Assume the coho¬ 
mology class c* is c-equivalent to the class c' through the path T; [ 0 , 1 ] —)■ iL^(T”,]R). 
For each s G [0,1], the following are assumed: 

(1) there exists a coordinate systems Gj^x where the first component of rotation 
vector is positive, wi(/.ir(s)) > 0 for each ergodic T{s)-minimal measure ^r(s)/ 

( 2 ) for the covering space Mg = M x in the coordinate system the lift of 

non-degenerately embedded codimension-one torus Er(s) has infinitely many 
connected and compact components, each of which is also a codimension-one 
torus. 


Then there exist some classes c* = co,ci, ■■■ ,Ck = c' on this path, closed 1-forms rji 
and fii on M with [rji] = Ci and [fii] = q+i — q , and smooth functions Qi on M for 
i = 0,1, ■ ■ ■ ,k — l, such that the pseudo connecting curve set ^{Li) for the space-step 
Lagrangian 

Lci,ui = L - {a, x) - Xi{ci+i - Ci- dui, x) + a{ci) 
possesses the properties: 


(i) each curve 7 G ^{Li) determines an orbit (7,7) of (jf^; 

(ii) the orbit (7,7) connects A{ci) to A(cj+i), i.e., the a-limit set a{dx) F A{ci) 
and iv-limit set u:{d'y) C A(ci+i). 


Corollary 

Theorem 


7.1 (Corollary 6.1 of [C12) L Let 


Ci+\; 
^0 


Xi and Ui be defined as in 


7.3. Let Ui be a neighborhood of N{ci) n Tfi.. such that Cj+i — Ci — dui\ui — b- 

> 0 and small d > 0 such that for each fh, fh' G M, 

{fh, in') reaches 


Then, there exist large Ki > 0, Ti 
with —Ki < rfii < —Ki 2tt, Ki — 2tt < m'^ < Ki, the quantity hj^^ 


its minimum at some T < Ti and the corresponding minimizer 7 j(f, m, m') satisfies 
the condition 


(7.5) 


Image(7j) C (S° d) C U. 


There is some flexibility to choose the coordinate system and the non-degenerately 
embedded codimension one torus. Let 'Kg'- Ms —)• M = T” be a covering space such 
that Mg = M X in the coordinate system Gf^x. 

Definition 7.3 (admissible toral section). For s G [0,1], the non-degenerately em¬ 
bedded codimension one torus is called admissible for the coordinate system Gf^x 
if the lift of Ss to the covering space Mg consists of infinitely many connected and 
compact components, the first component of the rotation vector is positive a;i(/rr(s)) 
for each ergodic T(s)-minimal measure. 
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7.3. Existence of generalized transition chain. Roughly speaking, a generalized 
transition chain is such a path F: [ 0 , 1 ] —iF^(M,M) that, for any s,s' G [ 0 , 1 ] with 
|s — s'l <C 1, the Aubry sets A.(r(s)) and A.(r(s')) are connected by local connecting 
orbit either of type-h or of type-c. An orbit ( 7 , 7 ) of the Euler-Lagrange flow is 
said connecting two Aubry sets if the a-limit set of the orbit is contained in one Aubry 
set and the w-limit set is contained in another one. It is called local connecting orbit 
if the first cohomology classes are close to each other. 

According to a result of Bernard mu, both Aubry set and Mane set are symplectic 
invariant. Therefore, from what have been done in previous sections one obtains clear 
understanding of the Aubry and Mane sets along the resonant path. It is good enough 
for us to construct generalized transition chain. 

Let us formulate the definition for autonomous Hamiltonian H : T*M —)• M where 
M = T” with n > 3. 

Definition 7.4 (Generalized transition chain: the autonomous case). Two cohomolgy 
classes c,c' G M) are said to be joined by a generalized transition chain if there 

exists apathT: [0,1] —)• L7^(M,M) such thatT{0) = c, r(l) = c', a:(r(s)) = E > mina 
and for each s G [0,1] at least one of the following cases takes place: 

(HI) In some finite covering manifold: tt : M ^ M the Aubry set A(r(s)) consists 
of two classes. There are two open domains Ni and Ni/ with Ni n iVj/ = 0, a 
decomposition M = Mi x [n — £ — 1)-dimensional disks {Om C 7 r_iMi} 
with Om n Om' = 0 ? an (n — 1)-dimensional disk Dg and two small numbers 
Ss,Sg> 0 such that 

(i) the Aubry set A(r(s)) n A^i 7^ 0, A(r(s)) n A^2 / 0 and A(r(s')) n (A^i U 
N 2 ) / 0 for each [s' — s| < 5s, 

(ii) 7 rA/’(r(s), M)|D^\(A(r(s)) + <5') is non-empty, of which each connected 
component is contained in Om x 

(hi) (r(s') — T{s),g) = 0 holds for each g G Hi{M, Mi, Z); 

(H2) For each s' € (s — 5s, s + 5s), r(s') is equivalent to r(s). Some section and 
some small neighborhood U ofJ\f{T{s))ri'Es exist such that {T{s') — T{s),g) = 0 
holds for each g G Hi{U,'L). Each class r(s') is associated with an admissible 
section S../ for the coordinate system Gf^x. 


Let us explain a bit more about the definition. In the case (HI), if the Aubry set 
contains only one Aubry class, one can take some finite covering vr : M —>■ M so that 
A{c, M) contains two classes. A typical case is that A(r(s)) is contained in a small 
neighborhood of lower dimensional torus. If A(r(s)) contains more than one class, we 
choose M = M. The (n — I)-dimensional disk Dg is chosen to intersect transversally 
all r(s)-semi static curves. If £ = 0, there is no restriction like (iii) on the class r(s'), 
it turns out to be trivial. 


Once such transition chain exists, one can construct diffusion orbits by variational 

To see the 


method, which is almost the same as in |CI2j . recapitulated in Section 7.4 


existence of such transition chain in the system (I.I), let us recall that, after (n — 3) 
steps of approximation of frequency, a path T*^: [ 0 , 1 ] —)• iLi(T”,]R) is fixed to pass 
through small neighborhood of - ■ ■ 

finite covering of open segments of curves 


uj^ in any prescribed order, which admits 


where T)^ ^ connects T)^ ■ 


r 

to T' 


C 


lJ(C,,ur)),ur-,urr,,) 


^fr,i connects T^ - to ■ and 
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(1) j is obtained by reduction for single resonance. For each frequency w G F^ ^ 

but finitely many the Mather set Ad(c) consists of only one hyperbolic 

periodic orbit where c G for those uj^, the Mather set consist 

of exactly two hyperbolic periodic orbits. All of these orbits share the same 
first homology class, i.e. certain gi G exists such that [ 7 ^^] = gi 

holds for each u G F^^j. These periodic orbits make up finitely many pieces of 
normally hyperbolic cylinder. Let Cj = it is an open and connected 

channel in 

(2) The segments F^^, Tf^ and F^^ are pretty short, lie in a small neighborhood 

of m-strong resonant frequency ujs,i- Their Fenchel-Legendre dual J, 

Contained in the annulus Aj surrounding the flat 
Fj = The annulus Aj admits a foliation of circles of cohomology 

equivalence, one is chosen as the chain of c-equivalence, in •) (z = I, r) 

one curve is chosen as the ladder. For c G (UF^ jF^^ U F^ •), the structure 
of Mather set A{c) may not be regular, could be complicated. 

Therefore, we are able to construct a generalized transition chain F: [0,1] —)• (T*^, M) 

F C U Tii^i U Fs,i U Tir^i) 

passing through small neighborhood of ''' ? where Tm,i C 

ijn = w,s,l), and 

( 1 ) F^^j passing through the interior of Cj, connected to Fs^, by the ladder F;; j 
and connected to Fg-i^j by the ladder F;^ j_i; 

( 2 ) Fs,i turns around Fj, connected to and to Cj+i, by the ladder F;;^j and F;^,^ 
respectively. 


Once such a generalized transition chain is constructed, we obtain immediately the 
main result of this paper (Theorem 1.1) by applying Theorem 7.4 in the following. To 
understand why the constructed orbit passes through the small ball Bs{pi) let us note 
following facts 


( 1 ) if the system is integrable, the relation p = c holds along each c-minimal orbit, 
there is an one to one correspondence between action variable p and frequency 
a; —)> p = dh~^{uj). For an integrable system under small perturbation of order 
0(e), it holds for each frequency ui G M"' that the size of ^p^{uj) is at most 
of order 0{^/e). Along any orbit in the Mane set A4(c), the action variable is 
restricted in the 0 (-y/e)-ball centered at c if we treat c and p as points in M”. 

( 2 ) keep it in mind that w* is chosen close to dh{p^) and the transition chain passes 
through Jf^^(a;*). From the construction of diffusion orbits, one can see that 
there is a quite long period when the orbit keeps very close to certain orbit 
in the Mane set Al(c) with c G .if^^(a;*). It implies the orbit passing through 
neighborhood of p*. 


7.4. Variational construction of globally connecting orbits. It was done in 
[Cl 2) for general dimension n. Here we only recapitulate the main steps in the proof 
and refer readers to |C12) for details. 

Theorem 7.4 (Existence of globally connecting orbit). If c is connected to d by a 
generalized transition chain F as in Definition\7.4\ then 
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(1) there exists an orbit of the Lagrange flow ( 7 , 7 )/ M —)• TM which connects 
the Aubry set A{c) to A{c'), namely, a^d'y) C A{c) and oj{d'y) C A{c'); 

( 2 ) for any ci, C 2 , • • • , G F and arbitrarily small 6 > 0, there exists time-sequence 
ti < t 2 < ■ ■ ■ < tk such that the orbit ( 7 , 7 ) 0 /^^ passes through 5-neighborhood 
of the Aubry set A{ci) at the time t = ti. 


Sketch of the proof. The proof of this theorem is the same as the proof of Theorem 7.1 
of |C12j . We only sketch the idea of the proof here, readers can refer to |C12j for the 
details. Because of the condition of generalized transition chain, there is a sequence 
0 = So < Si < • • • < Sfc = 1 such that for each 0 < j < k, ^(r(sj)) is connected to 
^(r(sj+i)) by local minimal orbit either of type-h with incomplete intersection or of 
type-c. The global connecting orbits are constructed shadowing such a sequence of 
orbits. 

Recall the construction of local connecting orbit as above, for each i G {0,1, • • • ,k} 
let r]i{x,x) = {ci,x) and 


pii{x,x) = Wi{d{Ti o s'f),x), ifi{x,x) = Xi{ci+i - Ci- dui,x) 


in certain coordinate system G- ^x (see (7.2), (7.4) for the definition), if it is for type-c, 
we set Hi = 0 . For each integer k we introduce a translation operator on functions 
k*f { xi , X2 , ■■■ ,Xn) = f{xi - /c, X2, • • • , X„). 


Let tt: M"" —)• M be the universal covering space. For a curve 7 : [—K,K'] —)• M”, 
let 7 = Try: [-K,K'] M. Let t = (tg , • • • ,t^_^,t'l), x = (xg , • • • 
with fl <t~ < tf_^i, tg = —K and t'^ = K'. we consider the minimal action 


(m, = inf ^ / {L - r]i){dy~{t))dt 

i=o 

^“1 ptf ^ 

+ ^ /_ {L-r]i-{kiGi)*{Hi + ipi)){dy:^{t))dt 


2=0 

where the infimum is taken over all absolutely continuous curves 7 : [—K, K'] —)■ M”’ 
satisfying the boundary conditions y~{t~) = yi^{t~) = x~, yi^{tf_^i) = for 

i = 0,1, • • • ,ik-i, l{—K) = cn, y{K') = m'. By carefully setting boundary condition 
we find that the minimizer is smooth everywhere, along which the term {kiGi)* {ni+fl'i) 
does not contribute to the Euler-Lagrange equation. It is guaranteed by the local min¬ 


imality of (7.5) as well as (7.3) and setting the translation fej+i — ki sufficiently large. 


The condition of incomplete intersection does not cause new difficulty in verifying the 
smoothness of the minimizer. Therefore, the minimizer produces an orbit ( 7 , 7 ) of 
which has the properties stated in the theorem. □ 


8. Genericity and the proof of the main theorem 

In this section, we first describe the generic set of perturbations in the C^-function 
space, for each eP in this set the Hamiltonian h{y) -|- eP{x, y) admits diffusing orbits. 
This set has a multiple filtration cusp-residual structure. After that we complete the 
proof of the main theorem. 
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8.1. Generic conditions: the multi-filtered cusp-residual condition (MFCR). 

The construction of diffusion orbits requires a list of conditions on perturbations, we 
shall verify them one by one in this section. There are mainly two types of conditions 
of genericity. The first type is called generic condition on hyperbolicity (HGC) since 
they are used to guarantee normal hyperbolicity of NHlCs. The second type is called 
generic conditions on transversality (TGC), since they are used to guarantee certain 
“transversal” intersection of the stable and unstable sets of the Aubry sets. 

Essentially, the genericity problem is reduced to the following two parametric 
transversality (PT) results, PTl for HGC and PT2 for TGC. 

(PTl) Consider a function T’^(x), M —)■ M, M compact manifold. Suppose F is 
C*”, r > 4 in X and Lipschitz in G [CoiCi]; then for V{x) in an open and 
dense subset of C^{M), the global max of T^(x) + V{x) is non degenerate 
uniformly for all ^ except for finitely many ^j’s, the function Fq.{x) has two 
nondegnereate global maxima. 

(PT2) Consider a function T’^(x), M —)• M. Suppose F is Lipschitz in x and a Holder 
in C G [Co, Cl]) then for generic P(x) G C^{M), r > 0, the function T^(x) + P(x) 
is non constant for all C- 

PTl is proved in [CZllICZ^ and used to establish the genericity of (Hl)^- and (H2)^-, 
j = 1, 2 ..., n — 2. PT2 is proved in |CYll [CY2] . 


8.1.1. Generic condition on hyperbolicity {HGC) and multi-filtered generic set. We 
have two types of HGCs: (HI) ■ and (H2) •, j = 1, 2 ..., n — 2 (see Section 


4.1 


and 

4.2.2). The HGC of the j = 1 case is in Section 3.2 and 3.4 (see (HI), (H2.1), (H2.2), 
Theorem 3.4 and Lemma 3.2 we do not include analogues of (H2.1) and (H2.2) in 
(H2)j since the spectral gap is easily guaranteed in the following reduction of order 
and will not affect the multiple-filtration structure). The HGC of the j = 2 case is in 
Section 3.11| (see (Hl) 2 , (H2)2 there). 


Since we have finitely many such HGCs, each of which is shown to be generic, we 
get a residual set in c^{T*T'^) consists of P{x,y) satisfying all the above HGCs. 
We point out that such a residual set also exists in C'”(T’^) (Mane genericity) because 
in this case (Hl)j is trivial and (H2)j is always shown to be generic in C'”(T"') as 
stated. In the following, we shall only talk about c^{j'*Y^) keeping in mind that all 
the arguments hold also for 


The generic set iU has a multi-filtered structure, namely, there is a filtration = 
with C and each admits a new filtration 

C 54?^ so on. After n—2 levels of filtration, we have TJ = 

Such a multiple filtration can be defined for any set if triviality is allowed, e.g. = QJ 
for all ii in the first level of filtration. To avoid such triviality, we require the inclu¬ 
sion C to be proper for all ij large enough and for all 

j = 1,2,... ,n — 2. The multi-filtration has a natural lexicographical partial order. 


•n—2 


given by 
j = 1,2 ,... ,n - 2. 


c , 


if (H,..., in- 2 ) 4 (i'l, • • •, in- 2 )^ i-e- ij < i'j for all 


We denote by the set of functions in C'^{j'*T'^) satisfying (HI), (H2.1), (H2.2), 

i.e. HGC for the first step of reduction of order. Notice 


Theorem 3.4 and Lemma 13. 2 


that these conditions are only imposed on Fourier modes of P{x, y) in spangjk'} 
or spanz{k',k"} where (k',a;a) = 0 for all a and {k”,cOa) = 0 for finitely many 
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a’s. We next decompose 03^ = with C where is the set of 

P{x,y) G satisfying 


—d‘^iZ'{x 2 -,y) > l/if in (HI) in Section 


A 2 > Ai + 1/ii in (H2.1) of Section 3.4 


3.2 


The normal Lyapunov exponents of the periodic orbits given by Theorem 3.4 
and Lemma 3.2 have absolute values greater than 1/ii. 


For P{x,y) G we get a NHIC of dimension 2n — 2 whose normal hyperbolicity is 
quantified by 1/ii. 


This is the first level of filtration. We next describe the second level of filtration. For 
each there is an associated > 0 (A1^ = 0 iff ifj^^ = 0 ) such that for all k G Z”" 
with Ik 


< 




we have that the NHIM Theorem 


3.1 


and 


3.2 


are applicable to 


1 


Here both k'^ and depends on ii through Al^. 

so that functions in satisfy (HI 

0 • 

* 1,*2 


show the persistence of NHIC when the perturbation is (5Z2((k', x), (k",x), (k, x),y) 
which consists of those Fourier modes in P{x,y) in span^jk', k", k}\spanz{k', k"}. 
We choose one such k denoted by k'^ which is perpendicular to in (3.38) and (4.4). 

For this k'^, we introduce a residual 
,, (H2)2. EachTJ,^ 

satisfying further 


setTJ)^ 


filtration := 


2, Each admits a further 

where if? ,• consists of functions in TJ? 

61,62 61 


• The eigenvalues of the matrix —HessZ^ in (Hl )2 are greater than l/i^- 

• The normal Lyapunov exponents of the periodic orbits given by (H2)2 have 
absolute values greater than 1 /^ 2 . 


Perturbation P G il?^ admits a NHIC of dimension 2(n —2) whose normal Lyapunov 
exponents quantified by 1/^2. As before, for each if?^ there is an associated number 
A^^ i 2 — *2 ~ 12 ~ which gives the upper bound for |k'^|“^ where k'^ 

is perpendicular to in (4.4) such that Eourier modes containing (k'^,x) does not 
destroy the NHIC when applying the NHIM Theorem 3.1 and |3.2[ Once this k'^ is 
chosen, we apply (Hl )3 and (H 2 ) 3 . 


We repeat this procedure for n —2 steps until all the HGCs (Hl)^-,(H2)j- are applied 
to get a residual set TJ = Uq<,..<i„_ 2 Tl)^~^ j _ in that admits (n — 2) levels 


of filtration structure. Eor each P G 23, the system (1.1) admits NHIC of dimension 
four (for the flow and two for the Poincare map). See Theorem 5.1 


The term “cusp residual” can be understood as follows. On the one hand, for 
fixed e, we cannot consider all P G 23 since there is an e remainder (the Rjj term) 
in the Hamiltonian which may destroy the NHIC if the normal hyperbolicity is too 
weak. Hence for each fixed e, we consider only P G 23”~^ for some labeling of 
(p,... ,in- 2 ), which is not residual in since its complement contains open 

sets. We call such a set 23f~^ ,• a cusp set. On the other hand, for each P G 23, there 
is ep sufficiently small such that the NHIC persists the e remainder for all e < ep. 
However, this ep cannot be uniformly bounded away from 0 for all P G 23. 


8.1.2. The transversality genericity condition (TGC) in the presence of NHIC. Eor 
each P G 23, the MECR set, the HGCs are all satisfied. We can find e small enough 
such that NHIC exists in a large part of the phase space applying Theorem 5.1 The 


NHIC containing all the Aubry sets A{c) with c G .if^(a;a) in the part of phase space 
7 away from the strong double resonance, where ut = A|(a, j • j is in (4.8). 


Qn ' 
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In the homogenized system, the lengths of NHIC that we get are of order 0{e 
From now on, we consider £ as fixed. 

In the presence of NHIC, the problem of constructing diffusing orbit is well under¬ 
stood as a priori unstable systems in |CY11 ICY2| . We recapitulate the main ideas of 
the genericity proof of TGC in |CYll [CY2] . 

Each Aubry set .4(c), c G carries stable and unstable “manifolds” whose 

graphs are {x,dxuf -|- c) in terms of weak KAM solutions u^. The construction of 
diffusing orbit requires either (HI) or (H2) in Dehnition |7.4| holds. (H2) holds when 
A{c) is not an invariant curve, so it remains to satisfy (HI) when for all A{c) being 
curves. The intersection of the stable and unstable manifold are given by du~ — 
du'^ = 0 and the projected Mane set AA(c) = ArgminHAx) where the barrier function 
Bc{x) = u~ — u'^. So to satisfy (H2)(ii) in Definition 7.4, the problem is reduced to 


showing that Bc{x) is non constant uniformly for all c with A{c) being an invariant 
curve. The parametric transversality result PT2 applies if we can show that the 
family of functions Bc{x) can be parametrized by certain parameter with respect to 
which the function is Holder. It is shown in [CY1[[CY2| that B„(x) parametrized by a 
parameter cr is 1/2-Holder, i.e. —B^^Wco ^ C\a\— for cri,cr 2 corresponding 
to invariant curves. In our setting this estimate holds in the homogenized system and 
the constant C becomes in the original scale.Since e is now a constant and only 

the Holder exponent is relevant in the proof of PT2, we conclude by applying the 
argument of |CYll ICY2| that 

there exists Eq such that for each e < Eq, 


for each cusp set G C”(r*T 

there exists a residual set • 

(1.1) with perturbation in P (z 


C QJ: 


n—2 

ii,... 


such that the Hamiltonian system 


by Theorem 5.1 




satisfies Definition l.f along the NHIC given 


It is shown in [C12| IC15b| that the same statement holds if the function space 
C”(r*T’^) is replaced by C”(T"). 


8.1.3. TGC for cohomology equivalence. It is a condition for V 2 {x) in (4.19) if we 
encounter the l®*-strong resonance. For generic V2, it holds for each c G SFq that the 
Mane set of the system ^(Ay, y) -|- V 2 does not cover the 2-torus: 7 rAA(c, 0) ^ 9 

{xi,X 2 ). It was shown in Theorem |6.1| (see also Theorem 5.1 of |C12) i. 


8.1.4. TGC for ladder construction. A ladder L is constructed connecting a circle of 
cohomology equivalence to an s-resonant channel. This ladder is composed of m — 2 
simple ladders if we are encountered m-strong resonance 

L = Li * Tfc+i * • • • * Lm—2- 

Each Lfc is constructed as a transition chain, i.e. any two Aubry sets .4(c), .4(c') are 
connected by local minimal orbit of type-h if c, c' G are close to each other. As 
explained in Remark 6.2, this is an autonomous version of Arnold mechanism. Hence 


the construction of the diffusing orbit is similar to the a priori unstable case and the 
genericity of (TGC) is also similar except that we need to use a new regularity result 
for weak KAM solutions. 


We need to check the conditions for twice. First, as the potential in (4.19) takes 


the form <5*14 (xi, 

YA=2^iyi{xx, - ■ ■ ,Xi). 


, Xi). Second, when the potential eventually takes the form 
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At the beginning of constructing L^, we have a reduced system with (n — tj 
degrees of freedom, where the last n — l — 2 coordinates are cyclic, namely, the 
Hamiltonian under consideration is independent of (x^+ 3 , • • • , Xn-i)- It allows 
us to treat this system as two degrees of freedom so that theorem of regularity 
properties of weak KAMs of system with two degrees of freedom (Theorem 


9.2) can be applied. As the reduced system lies on normally hyperbolic mani¬ 


fold, such regularity of elementary weak KAMs can be extended to the whole 
space (Theorem |9.3[ ). Using techniques developed in Section 8.3 of |C12| (see 
Theorem 8.1 there), one can show it is a generic requirement for Vi that each 
connected component of the set of minimal points of Be 


Argmin{Hc, Sc PI (T"- ^\M(c) e)} 


is contained in Om x where Sc = Sc ,3 x is codimension one 

section. Sc ,3 is a section in which is the configuration space for the first 
three coordinates, Om C Sc ,3 is a 2-dimensional disk. 

• After the construction of all simple ladders, Li, • • • ,Lm- 2 ) we need to check 
whether Li, • • • ,Lm -3 can survive small perturbation. For L^, the extra per¬ 
turbation takes the form of '' ‘ j®*)- upper-semi 

continuity of Mane set in perturbation, it holds V c G that each connected 
component of 


Argmin{Hc, Sc P (T"- \A{c) -F e)} 


is still contained in Om x provided 5* <C for i G {£ -|- 1, • • • , m — 2}. 

As a matter of fact, certain small constants d > 0, D{Om, d) > 0 exists such 
that 

min Br — min Be > D(Om,d). 

d{Om+d)xT^-W Sc 

If I • ^Xi)\ < ^i{D{Om,d)) with sufficiently small aJ > 0, 

the property as above persists. 


8.1.5. The MFCR set in the main theorem. We get a residual set QJ by imposing the 
HGC and further a set residual in QJ (hence also residual in (or C'^(T"'))) by 

taking union over all the sets 91”“^ ,• such that both HGC and TGC are satisfied. 
Perturbations chosen in the last residual set have Definition 17.41 satisfied hence admit 
diffusing orbits. We then apply the Kuratowski-Ulam theorem to get a residual set 
on the unit sphere ©1 and for each P G 91 a number ep and a set Rp residual in [0, ep] 
such that {AP : P G 91, A G Rp} satisfy both HGC and TGC hence admit diffusing 
orbits. This gives the cusp-residual set in the statement of the main theorem. 


8.2. Proof of the main theorem. Given any two p-balls as stated in the theorem in 
the space of action variables, we find a p-accessible path (c.f. Definition 4.2) according 
to Theorem 4.1 It is enough to show how to build the generalized transition chain 


(c.f. Definition 7.4) along such a p-accessible path. We show in Section 7.4 that such 
a generalized transition chain gives rise to an orbit of the Hamiltonian system visiting 
the Aubry sets associated to the cohomology classes along the transition train. 

Since the p-accessible path is piecewise linear, we distinguish two different cases: 


1 , along each admissible line segment, 

2 , transition from one line to another. 
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Here we replace A(k''^) in the item (2) in Definition 4.1 of the admissible line segment 


by min{ A (), Aj }, where A 


is given in the second bullet point of Section 


1.1.4 


With this change in the definition of admissible frequency line segment, the proof o 
Theorem 4.1 goes through. 

For the case 1, along each admissible line segment, we have further two cases 


1.1, Single resonance and weak double resonance, 

1.2, Strong double resonance appearing at the j-th step of the reduction of order. 


For the case 1.1, we have a complete understanding of the structure of the Aubry 
This case is the same as systems of a priori unstable type studied 


set in Theorem 5.1 


in [CYli ICY2| . The generalized transition chain is built by applying Theorem 7.2 
with i = 0 (complete transversal intersection). 

For the case 1.2, the transition chain is built step by step as we did in Section and 
We consider the case j = 0, i.e. we encounter the strong double resonance before 


6.2 


any reduction of order, and the cases with j > 0 are done similarly. The method is to 


apply the c-equivalence in Lemma [6.1| in Section 6.2 to move the first two components 
of the cohomology classes. Next, we perform the scheme of ladder climbing in Section 
6.2 after each reduction of order. Here after j-th reduction of order, we have a system 


of n — J degrees of freedom. In this case, we apply Theorem 7.2 with i = n — j — 2 
being the dimensions whose transversal intersection of the stable and unstable “sets” 
of the Aubry sets are not needed. This completes the construction of the generalized 
transition chain for case 1. 

Next we consider the case 2. Consider frequency vector of the form (a, 

According to the construction in Theorem 4.1, when we stop moving a and start mov¬ 
ing the second component denoted by b, we set a to be a rational number that is very 
close to a Diophantine number \a — ujl^\ < /in- 2 - We next perform the procedure 
of reduction of order forn—3 steps to arrive at a frequency vector of the form (a, b,^), 
where < /in-Sj associated to a Hamiltonian system of three degrees of free¬ 

dom. Since all the three components of the frequency vector is rational, we are in the 
case of double resonance for a Hamiltonian system of three degrees of freedom. Sup¬ 
pose a/(pn/gn) := ^ and 6/(pn/g'n) := ^ with g.c.d.(pa, Q'a) = 1 and g.c.d.(pa, ^a) = 1 , 
and we consider resonance vectors of the form k^ := {qa,0, —pa), k;, := {0, qb, —ph). 
The frequency line obtained by varying a (respectively b) lies in the kernel of k^, (re¬ 
spectively kfl). Applying Proposition 2.2 for double resonances and perform linear 


symplectic transformations, we reduce the Hamiltonian system into the form of G in 
Lemma l3.ll with a remainder that can be chosen to be as small as we wish. We next 


apply the method of c-equivalence in Lemma 6.1 in Section 6.2 to build transition 
chain to transit between two frequency lines. 

For finitely many /o-balls in a~^{E), a transition chain can be constructed similarly 
to pass through these balls. It gives rise to an orbit of the Hamiltonian system visiting 
the Aubry sets associated to the cohomology classes along the transition chain, namely, 
the constructed orbits visit these balls Bp{yi) in any prescribed order. The smoothness 
requirement r > 2n is given in Remark |4.1[ This completes the proof of the main result 


(Theorem 1.1) 
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9. PARAMETRIZATION of weak KAM solutions AND ITS MODULUS OF 

CONTINUITY 

In this section, we show that the weak KAM solutions on a level set of a Tonnelli 
system of two degrees of freedom can be parametrized by a parameter such that they 
have Holder modulus of continuity. This is a substitute of the Holder regularity result 
of |CYlj to show the genericity of the perturbation P needed for the of construction 
of ladders in Section 16.21 

9.1. Topology of level set of weak KAM solutions. The modulus of continuity 
of weak KAM solution is obtained for some “volume” parameter. The introduction 
of this parameter relies on the well-understanding of the topology of level set of weak 
KAM solutions. Denote the level set of U — U' by 

Zu,u' = {x G : U{x) - U'{x) = 0}, 

and let 

= {x E : U{x) > U'{x)]. 

We let Uc = Uc + (c, x) and drop the subscript c when it is clearly indicated, where 
Uc is a globally elementary weak KAM solution for certain Aubry class in A{c) (see 
Appendix |A.3| for its definition). Obviously, Uc is a viscosity solution of the Hamilton- 
Jacobi equation 

H(x, dxu) = a(c), X E M"". 

Global viscosity solutions exist provided a is not smaller than so-called critical value, 
i.e. the minimum of the a function. 

Theorem 9.1. Let U = u + {c,x), U' = u' -\- (c',x) he globally elementary weak KAM 
solutions, where c and c' are the the corresponding co-homology classes, u and u' are 
bounded on the whole M”, then 

If a{c) > a(c') then Itu',u is simply connected and unbounded, Llu,U' contains only 
one unbounded connected component; 

If a{c) = a{c') > min a, c and c' are not in one flat, then both Llu,U' o,nd Ilu',u cire 
simply connected and unbounded. 

Before the proof, let us state a lemma. A function m: [0, oo) —>■ [0, oo) is called a 
modulus if it is continuous, non-decreasing and satisfies m(0) = 0. 

Lemma 9.1. For the Hamiltonian H we assume that there is a modulus m such that 

H{y, X{x - y)) - H{x, X{x - y)) < m{X\x - + \x- y\) 

for A > 0 and x,y £ Q. Let II be an open subset o/M" and f E C{Il) satisfy f{x) < 0 
for x £ Q. Let u,v £ C(Q) satisfy 

H{x,Du)<f{x) and H{x,Dv)>0 in Q 

in the viscosity sense. If dH 0, u < v on dH, u — v is bounded in H, then u < v on 

n. 

Let u,v £ C{Il) satisfy 

H{x,Du)<0 and H{x,Dv)>0 in fl 
in the viscosity sense. If we assume further that 
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(1) there is a function cj) G (7^(0) n (7(0) such that (f < u on Q and 

sup{H {x, D(p{x)) : X € uj,u <0 Vo; CC O; 

(2) the function p —)■ H{x,p) is convex on M" for each x G O, 

then u < V on 0, provided that u — v is bounded in 0 and u < v on dit with dQ 0. 


Proof. We use the same method as used to prove Lemma 1 and Theorem 1 in [Isj and 
Theorem 2.1 in |CELj . We choose a smooth function (5 such that 0</3<l,/3(0) = l 
and /3(x) = 0 whenever ||x|| > 1. Let f5e{x) = /3(x/e). 

Since u — v is assumed bounded, there exists a smooth function cj) such that both 
u — cj) and v — f are bounded. By a translation H{x,du) —)• H{x,du + dcf) we can 
assume that both u and v are bounded in fl. 


To prove the first part of the lemma, we assume the contrary, namely, 3 xq G 0 
such that u(xo) — v(xo) > 0. Let M = max{||u||, ||u||} and introduce <h: x —)• M 
be given by 

<l>(x, x) = u{x) — v{x) + 2>Mj3e{x — x). 

Because u — u < 0 on 911 and ‘h < 2M if ||x — x'|| > e, one obtains that 
<l>(xo, xo) = u(xo) - u(xo) + 3M/3e(0) > 3M, 

and 

$|g(Qxn) < ‘h(xo,xo) if e > 0 is sufficiently small. 

However, if H is unbounded, the maximum of ‘h may not be reachable although it is 
bounded. Choose <5 > 0 very small and there exists (xi,x'^) so that 

<h(xi, x'^) > sup <h(x, x') — 6. 

Uxfl 

We choose a smooth function f: H x H —)• M such that 0 < C < 1; Cixi,x'i) = 1, 
C(x,x^) = 0 whenever ||x — xi|p + \\x' — x'^|p > 1 and \Df\ < 2 in H x H. Finally, we 
set 

T(x, x') = $(x, x') + 26C{x, x'). 

Clearly, T has a global maximum point (x, x^) G H x H. Indeed, 

'I'(xi, x)^) = $(xi, x'l) + 26 > sup d* + (5 

0x0 


whereas 

limsup 'L(x,x') < sup $ 

||3:||+||3:''||—>-cx3 QxQ 

and 

'I'(x, x')lg(QxO) < sup + (f if 5 > 0 is sufficiently small. 
0x0 


Obviously, we have 


X — x' 


Now X is a maximum point of x —)• u{x) — 
thus, by assumption 


< e. 

(u(x') — 3M/3e{x 


x') — 26C{x,x')) and 


(9.1) H{x, —3Mdj3e{x — x) — 26dxCix, x')) < /(x). 


Similarly, x' is a minimum point of x' —)> u(x') — (u(x) + 3Mj3^{x — x') + 26f{x, x')) 
and so 


(9.2) 


H{x\ —3Mdj3e{x — x') + 26dx'C{x, x')) > 0. 
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Since f{x) < 0 and e as well as 6 can be chosen arbitrarily small, (9.1) contradicts to 
(9.2), which implies that u < v on Q. This completes the proof of the first part. 

For the second part of the proof, we set 


ug{x) = 9u{x) + (1 — 0)(j){x) for x G Q, 

where 9 G (0,1). By the conditions, we can choose function / G (7(0) such that 
H{x,d(j){x)) < f{x) < 0 and f < u on Q. thus, we see that ug < u on Q and 
ug G (7(0). Note that p —)■ H{-,p) is convex, a formal calculation reveals that 


H{x, dug) < 9H{x, du) + (1 — 9)H{x, d(j)) < f. 


Applying the first part of the lemma and noting 9 can be arbitrarily close to 1, we 
complete the proof for the second part. □ 


Proof of Theorem 9.1. For each connected component of ^u',u or ^u,U'i 3 a func¬ 
tion g defined on dO. such that Uc\dQ, = Uc'\do. = 9- Thus, both Uc and Uc' are the 
viscosity solution of the Dirichlet problem 

{ H{x, du) = a, in 0 , 
u = g, on dQ 

where a is valued as a{c) and a(c') respectively. Note, dQ / 0 and Q is not necessary 
bounded. 


Clearly, the difference of U and U' is decomposed into a sum of bounded function 
and a linear function: U — U' = An -|- (c — c', x). Denote the level set oiU — U' by 

= {x G : U{x) - U'{x) = 0}, 

it is restricted a strip {|(c — c',x)| < D} for certain positive number D > 0, i.e. both 
PLu',u and contain a unbounded connected component provided c ^ d. 

If a{c) > a(c'), ^u',u is connected, otherwise there would be a connected component 
contained in the strip {|(c — d,x)\ < D}. On this connected component U — U' is 
obviously bounded, thus the lemma is violated. Thus, ^u,U' contains one unbounded 
connected component only. The possibility can not be excluded that contains 

some bounded connected components. 

For the case that a{c) = a{d) > min a, but c and d are assumed not in one flat, 3 
0 < A < 1 such that 

a{c*) < a{c), where c* = Ac -|- (1 — A)c^ 

By the result in |FSj . we know that there exists a (7^ global sub-solution of the equation 
H{x,du + c*) = a{c*), d- T"' —)• M, i.e. 0(x) = (f>{x) + {x,c*) satisfies the condition 
H{x,d(j)) — a{c*) < 0 . Since 

U — (j) = Au -|- (1 — A)(c — d, x), 

and 

U' — (f = — A(c — d, x), 

where both Au and Au' are periodic, thus U — (f> SiS well as U' — 4> can be set positive 
in the strip {|(c — c', x)| < D} by adding a suitable constant to cf- Since H is assumed 
convex in the action variable, by applying the second part of the lemma we find that 
both and are simply connected and unbounded. This completes the proof 

for the second part. □ 





ARNOLD DIFFUSION 


91 


From this theorem we see that consists of some pieces of (n — l)-dimensional 

“surface”, which may be not smooth. However, there exist some properties to describe 
their “normal” direction. The “derivative” set of U — U' at the point x is defined by 

(9.3) Du,u',x = {y - y' : y G D+U{x), y' G D^U'{x)\, 

where D^U denotes the sup-derivative of U (see Formula (A.3) in the appendix for 
the definition). Clearly, Du^u',x is closed and convex. We define 

(9.4) = {n G M" : (y,n) > ,5||n||, V y G Du,u',x], 

^u’u',x = G K'" : {y,v) < -(5||n||, V y G Du,u',x]- 

If 0 ^ both of them are not empty provided 5 > 0 is suitably small. 

Given a convex function "ip, for each direction e G MT" with ||e|| = 1, there exists a 
sub-derivative of i/; at a: such that 

tpix + te) - ip{x) = t{y^^e, e) -h 0{t^) t>0. 

Indeed, we have (y^,e) e) = deipix) where de'ip is the derivative of ip in the direction of 
e. Note each backward weak KAM solution has decomposition of a smooth function 
minus a convex function U = cp —ip. By dehnition, for xi = x + te with t > 0 we have 

(9.5) U{xi) - U{x) =(ye,xi - x) + 0{\\xi - x|p), 

<(y, xi-x) + 0(||xi - x||^), V y G D^U{x) 
where ye = d(p{x) — y^^e- Similarly, we have 

(9.6) U'{xi) - U'{x) =(ye, xi-x) + 0(||xi - xf), 

<(y', xi- x) + 0(||xi - x|p), V y' G D^U'{x) 

where y'^ = d<p'{x) — y^p'^e- Therefore, if x G and xi — x G S^'^, we subtract 

(9.6) from (9.5) and obtain 

U{xi) - U'{xi) >(ye - y',xi - x) - 0(||xi - x||^) 

><5||xi — x|| — 0(||xi — x||^). 

Thus, there exists suitably small e > 0 such that 

(9.7) U{xi) - U'{xi) >0, V xi - X G n He(x), x G 

where is a ball in centered at x with radius e. In the same way, we obtain 


(9.8) 


U{xi) - U'{xi) <0, V Xi - X G n He(x), X G Zu^u'- 


9.2. Modulus of continuity of weak KAM solutions, on positive energy level of 
systems with two degrees of freedom. 

In the this section, our attention is concentrated on the systems with two degrees of 
freedom, i.e the configuration manifold is T^. We consider the level set of a-function 

Ce = {c G : a(c) = E]. 

It is a closed and convex curve if Fi > min a. By adding a closed 1-form to the La- 
grangian, we can assume that the a-function reaches its minimal at zero cohomology. 
In the following, we shall assume this condition. In this case, Cg encircles the origin. 

Proposition 9.1. Assume that 7 and 7 ' are forward {backward) c-minimal and c'- 
minimal curves respectively, and a{c) = a(c'). Let 7 and 7 ' denote their lift to the 
universal covering space, then 7 crosses 7 ' at most once. 
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Proof. We consider forward minimal case only, i.e. they are defined on [0,oo). The 
backward case is the same. Let us assume that they cross twice, i.e. 3 ti ^ t 2 and 
Si / S 2 such that 7 (L) = for i = 1,2. Clearly, 9i = 'y{ti) — 'y'{si) 7 ^ 0. By a 

fundamental lemma of Mather in [M91| we obtain that some e > 0 and C > 0 exist 
depending on the Lagrangian only such that 


where 0 *: [—e, e] —)• is a minimal curve of L joining ^(si — e) to + e), i.e. 

ai(-e) = 7 '(si - e), aj(e) = j{ti + e) and 



L{a{t),a{t))dt 


inf [ L{C{t),C{t))dt, 

C(-e) = a(-e) J 
C(e) = a(e) 


bi'. [—e,e] —>■ is a minimal curve of L joining 7(31 — e) to + e) and A{bi) is 

defined in the same way as for A{ai). Let As = S 2 — si, At = t 2 — ti, we define two 
curves 


m = 


and 


C'{t) = 


7 W, 

t G [0,fi - e]. 

bi(t - h), 

f - fi G [-e,e]. 

I'it-h), 

f - fi G [si + e,S 2 - e]. 

a 2 (t - + As), 

t — ti As G [~e, e], 

j(t - fi + As), 

t - ti + As G [f 2 + e, 00 ), 


t G [0,si - e], 

ai(t - si). 

t - Si G [-e,e]. 

lit - si), 

f - Si G [ti + e,t 2 - e]. 

b2{t - Si + At), 

f — Si + At G [~e, e], 

i(t - Si + At), 

t — Si + Af G [s 2 + e, 00 ). 

we have 7 (fi — e) 

= iih - e), 7 (f 2 + e) = 7 


7 '(si - e) = 7 "(si - e), ^'{s 2 + e) = 7"('Si + At + e) and 


[^c(7l[ti-e,l2+6])] + [^c'(yi[si-e,S 2 +d)] “ [^c(C|[ 7 - 

6 ,si-t-At+e]}] 

^(tI [ti —e,li+e]) T ^(7 I [si—e,si+e]) ~ A[ai) — A(bl) 

+ ^(7l[l2-e,t2+e]) + ^(7l[s2-e,S2+e]) “ ^(“ 2 ) “ 

>Ce(||0if+ ||02f). 


This contradicts that fact that 7 and 7 ' are c- and cLminimal curves respectively. The 
absurdity verifies the fact that 7 crosses 7 ' at most once. □ 


Corollary 9.1. Assume that both 7 and 7 ' are forward (backward) c-minimal curves, 
dj and d'j' share the same u- or a-limit set. Let 7 and 7 ' denote their lift to the 
universal covering space, then they do not cross everywhere. 


Lemma 9.2. Assume the autonomous Lagrangian is defined on two-torus. For each 
c G Ce with E > min a, the rotation vectors of all orbits in M(c) have the same 
direction, i.e. ifd'yi,d'y 2 & A4(c), then 

(w(7i),w( 72)) = ||a;(72)||||a;(72)|| > 0. 
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Proof. We claim that for each orbit d'y: TM in the support of the minimal 

measure, we have (c — c*, [ 7 ]) 7 ^ 0 , where c* is the minimum point of the a-function, 
which is defaulted to be zero here. If it was not true, there would be a sequence of 
time Tj —>■ 00 as i —>■ 00 such that limj_^oo 7(7i) = 7 ( 0 ) and 

Y. [ Md7it))dt = Y, [ md^it)) - {c,'y{t)))dt 

i u 0 ^ t/ 0 

= —E < — min a, 

but it is absurd. As an immediately consequence, [ 7 ] 7 ^ 0 holds for each d'y in the 
support of the minimal measure. 

Next, we see that the rotation vector of one orbit in the Mather set is co-linear with 
the rotation vector of any other orbits in the set. Otherwise, they would intersect 
each other for infinitely many times. But it contradicts the Lipschitz graph property 
of Aubry set. 


Finally, we show that all of the rotation vectors have the same direction. Let us 
assume the contrary, i.e. there exist two orbits 7 ^: M —)• M {j = 1,2) such that 

[71] = -[72] 

and 


rP. 
I u 


L{d'yi{t))dt - (c, [ 7 i]) + Ti^E = 0, 


j = l,2, 


where Ti. —)■ 00 as ij —)• 00 such that limj^._,.oo = 7 j( 0 ). As (c, [ 7 *]) 7 ^ 0 for 

i = 1,2 and —[ 71 ] = [ 72 ], we can assume that (c, [ 71 ]) < 0 . As Cg is a closed and 
convex curve encircling the origin, there exist d ^ Ce and A > 0 such that d = —Ac. 
By computing the c^average action along 71 , we find 
rTu 


^ [ {L{dl{t)) - {d, [ 71 ]) + E)dt 

Ii Jo 


-‘*1 Jo 

= ^ [ {Lid'y{t)) - (c, [ 71 ]) + .& + ((! + A)c, [ 7 i]))(it 
Jil Jo 


^((1 + A)c, [ 71 ]) < 0. 
The contradiction completes the proof. 


□ 


Each energy level has a natural fibration. The fiber over a point x is denoted by 

Y,,E = {y.{x,y)£H-\E)]. 

\i E > min a, Yx,e is a smooth, convex and closed curve for each x G M. Indeed, 


because of Proposition |A.2 


there is at least one backward semi-static curve 7 “ for 
each c G Ce such that 7^(0) = x. Clearly, dxL{'y~{0),'y~(0)) G Yx,e- As E > min a, 
Ce contains at least three vertex, namely, there are at least three cohomology classes 
in such that the semi-static curves are different. Consequently, Yx,e is a closed 
curve. The smoothness and the convexity are obvious. Let 


Y'x,e = {v = dyH{x,y) : y G Yx,e}, 
it is a smooth, convex and closed curve in encircling the origin. 

Given three vectors vi, 02,03 G Yx,e C M^, we say that they are in clock-wise order, 
denoted by oi P 02 P 03 if the points H^’ ™ clock-wise order on the 

unit circle. Via the one-to-one correspondence u —)■ y = dxL{x,o), 01 , 02,03 in Yx,e 
uniquely determine three points yi,y 2 ,yo in Yx,e- As L is strictly positive definite 
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in the speed, we have yi — yx ^ 1/2 — Ux ^ Us — yx where yx is the minimal point of 
H{x,y) with fixed x. 

For each y G ^x,E, there is a unique smooth curve 'y{t,x,y): M —>■ M such that 
7 ( 0 ,x,y) = X, dxL(x,j(0,x,y)) = y. Indeed, is a trajectory of the La¬ 

grange flow. However, it is not necessary that each of these curve is semi-static. Let 

^x,E = {y ^ ^x,E ■ 7 (->a^: 2 /)lteR- = for some c G C^}, 

namely, each point y G determines a backward semi-static curve 'y{-,x,y) for 

certain cohomology class c such that 7 ( 0 , x,y) = x and dxL{'y{0, x,y),'y{0, x,y)) = y. 
It approaches to certain Aubry set in the following sense 

(^{d'yi-,x,y)) C A{c). 

Since the configuration space is two-dimensional, each orbit in a{d'y{-, x,y)) n A{c) 
has the same rotation vector, denoted by uj{x,y). Because of upper semi-continuity 
of backward semi-static curves on cohomology classes, the set Y~ ^ is closed in Yx,e- 

As the complementary set Yx,e\Y~^ is composed of open intervals in Yx,e, an 
equivalence relation ~ in Y~ ^ is introduced such that y ~ y' if y and y' are the two 
boundary points of an open interval in Yx,e\Y~ 

Proposition 9.2. If E > min a, y ^ y' in then Lo{x,y) and uj{x,y') have the 

same direction, i.e. 

{uj{x,y),uj{x,y')) = ||a;(x, y)|| ||a;(x, y')||. 

If c and d are the cohomlogy classes such that 7 (-,x,y), 7 (-,x,y') are the c-, d-semi 
static respectively, then they stay in the one flat of the a-function. 

Proof. Let us assume the contrary, namely, the direction of uj{x,y) is different from 
that of uj{x,y'). Under such assumption, the curves 'y{-,x,y)\^^^- and 7 (-, x, y')|jg]R- 
can not be semi-static for the same cohomology class, it is guaranteed by Lemma |9.2[ 

Denoted by c and d the cohomology classes such that 7 (-, y)|teR- is c-semi-static 
and j(-,x, yOlieiR- is c'-semi-static. We claim that c and d are not contained in one flat 
of the a-function under the assumption. Indeed, by the result of |Msj . A{ci) = A{c 2 ) 
whenever both ci and C 2 are in the relative interior of the flat, A{ci) D A{c 2 ) if ci is on 
the boundary of the flat while C 2 is in the relative interior. Choose c* from the relative 
interior of the flat where c, d are, then we have A{c) A A(c*) and A{d) A A{c*). But 
this places us in a dilemma: if the direction of the rotation vector for A{c*) is the 
same as that for A{c), then it is different from that for A{d), and vise verse. 

Lemma 9.3. If an autonomous Lagrangian is defined on two dimensional torus, then, 
any flat of the a-function is disjoint to any other flat. 

Proof. If there are two flats sharing a common boundary point, the derivative of the 
a-function at one flat is not colinear with the derivative at another one. Therefore, for 
the cohomology class at the common boundary point, the Mather set contains orbits 
with non colinear rotation vectors. However, it is impossible when the configuration 
space is a two dimensional torus. □ 

According to this lemma, we see that C^; is a closed and convex curve with infinitely 
many edges whenever E > min a. As is a convex and closed curve encircling the 
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origin in this case, other two cohomology classes ci,C2 G Ce exist such that 

Cl -< C -< C2 -< c' -< Cl, 


and no flat contains any two of these cohomology classes. Let loi,uj,u}2,uj' be the 
rotation vectors conjugate to ci,c, C2,c' via Fenchel-Legendre transformation 
respectively 

U! G .if^^(c) {u, c) = a(c) + 

then we have 


OJl < U < UJ2 < Oj' < OJl- 


By choose suitable ci and C2, we assume that any two of these rotation vectors are 
not colinear. Guaranteed by Proposition A .2 there exists a backward Cj-semi static 
curve 7“ (z = 1 , 2 ) such that 7~(0) = x. Denoted yi = dxL{'y~( 0 )), we claim 
that 

yi- Vx < y -Vx < y2- Vx -<y' - Vx -<yi - Vx- 


Indeed, y2 - yx < y - Vx < y' - yx, we would have 73 ( 0 )) 7(0, x, y) < 7(0, x, y'), 

because is positive definite. Let j{-,x,y), ^{■,x,y') and 72 denote the lift of 

the curves 'y{-,x,y), 7(-,x,y') and 72 to the universal covering space respectively 
such that 7(0, X,?/) = j{ 0 ,x,y') = 72(0). In this case, 72 either cross 7(-,x,y) or 
cross 7(-,x,y') at another point, because of the order oj < uj2 < oj'^ see Figure]^ But 
this violates the property that all these three curves are backward semi-static. This 
contradiction verifies our claim. However, the induced conclusion yi — yx y — yx ^ 


y) 



y 2 — yx < y' — yx < yi — yx contradicts the hypotheses that y ^ y' . It implies that 
uj{x,y) and uj{x,y') have the same direction. Consequently, c and c' are in one flat of 
the a-function. □ 

Let us consider all of those Aubry sets {Ac} such that the rotation vector u{fic) is 
non-resonant. In this case, each c-minimal measure is uniquely ergodic and the lift of 
the Aubry set to any finite covering kT^ = {(xi,X2) G : x* mod kj} still consists 
of only one Aubry class. Let u~^ be the (backward) globally elementary weak KAM 
solution for Ac- We consider the set of globally elementary weak KAM solutions 

= {C/^00 = '^^00 + {c,x) : Vc G Ce, oj{yLc) is non-resonant}, 

and let 

^i.K = {v%W ■■ uti 

Let ^ek closure of ‘^ek ™ G'^-topology. 

Theorem 9.2. Let M = and E > min a, then for each finite K > 0, the set 
^EK parameterized hy some parameter a defined in a closed interval such that 

a —)■ is ^-Holder continuous in -topology. 
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Proof. As the first step, we consider the set ‘^^K- denote the rotation vector of 

the Q-minimal measure, Uf be the globally weak KAM solution, (“Oo, 0] —)• M 

be the backward semi-static curve for the cohomology classes c* such that 7 ^^ 3 ;( 0 ) = x, 
9 iL( 7 “ 2 ,( 0 ), 7 “ 3 ,( 0 )) + Ci = dxU~ (x) whenever U~ is differentiable at x. Denoted by 

-< Uj 2 Ujg provided 


Vi = jci,x{^) consider them as vectors, we see that 
Wjj -< L 0 i 2 ^ ^* 3 ) see the proof of Proposition 9.2 


For each point x G and each q, there exists at least one backward semi-static 
curve (“00,0] —)• such that 7^1 ^.(O) = x and 7 ra{d'ypJ C Ad(ci). Let yi = 

dxL{vi, x), then y* G for 1 = 11 , 12,13 and yi^ - yx < Vh - Vx < Vh - Vx provided 

Vil P Vi2 -< Ujg . 


Given x G T^, there might be two backward c-semi-static curve originating from 
this point, denoted by 7 “ and 7 '“ with v = 7 (^( 0 ) ^ v' = 7 '“( 0 ). In this case, 
the weak KAM solution is not differentiable at this point, there are at least two 
points ycy'c £ such that yc = dxL{vc,x) and = dxL{v'^,x). The closed curve 
y^x,E is divided into two segments of arc, both have yc and y'c as their end points, 
Ix,Lu ^ I'xuj — ^x,E, where w is the associated rotation direction. We claim that either 
I'x or Ix,u} is forbidden for any ix' ^ uj m the following sense: for each y in this segment 
and for each lv' ^ oj, (x, y) does not determine any backward c'-semi-static orbit which 
approaches to the support of certain minimal measure with rotation direction oj' . If 
not, there would be a backward c'-semi-static curve 7 ^ as well as a backward c*-semi¬ 
static curve 7 ^, originating from x with associated rotation direction oj' , oj* satisfying 
the order condition oj' < oj < oj* < oj. In this case, either 7 “ or 7 '“ would cross 7 ^ 
or 7 ^ somewhere. But it is absurd. 


Let yi G Y^ ^ such that yi = dxL^j^. ^{0),x). By the notation we used before, 
yi G D+C/“(x). As yi < yj -< yk holds for each yi G D+[/“(x), yj G D+f7“(x) and 
yk G D~^U^{x), we use the notation 


(9.9) 


D+U-ix) ^ D+U-{x) ^ D+U^{x) 


to imply this relation. Different x determines different yi (might be multi-valued). 
However, the order property is independent of the position x, i.e. yi^ -< yi^ A yi^ 
holds everywhere once it holds at some point x G M, namely, the relation (9.9) is 
independent of x. 


By choosing suitable constants we assume that (xq) = 0 for each i, where the 
point Xq is pretty far away from the origin. Denoted by 


Z-. = {xGM^:C/r-C/-=0}. 

As these rotation direction are all different, any two of the cohomology classes are 
clearly not in the same flat. In virtue of Theorem 9.1, each Zp- is a continuous curve 

' ^ij 


not containing a segment encircling an bounded region. Thus it extends to infinity. 


Lemma 9.4. Assume that a(ci) = a(cj) = a(ck), the rotation vectors oJi,ojj and ojk 
are non-resonant and oji ojj oJk, then Z~. interseets Z~^ only at x = xq. 


Proof. By Theorem A.l the vertex of (x) must be on Y^ Thus, one deduces 


from the assumption oJi / ooj oj^ that the “derivative” sets (see (9.3) for the defi¬ 
nition) Pdy- u- 2 ,, Djj- jj- ^ and Djj- jj- ^ are disjoint to each other for each x G M, 


non of them contains the origin. If oJi ^ ojj -< oju, then yj — yi P yk — Vi P Uk — Uj, 
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namely 

(9.10) 


^U-,U-,x ^ ^U-,U-,x -< ^U-,U-,x- 


Consequently, each intersection point of Z- ■ with Z-, is isolated. To see it, let us note 
that 0 ^ coDjj- jj- holds for any x £ M and i ^ j. If Z~- intersects Z~j^ at some 
point x, then U~ (x) = U~ {x) = (x). Let x' G Zij be a point close to x, we obtain 


from the formula (9.7) and (9.8) that x' — x is almost orthogonal to certain vector 

y ^ ^u-,u-,x- -x,y) < S 


X — X 


holds for sufficiently small 5 > 0. Therefore, 


we obtain from (9.10) that 


z-nB.cBA(S,L„-.uS-i‘„_.) 


U-,U-,x 

7 , ’ J ’ 


u-,u-y 


Ui ,U^ ,x [/; ,Uj ,x 

provided > 0 is sufficiently small. It implies that x is the only point in B^{x) where 
Z~- intersects Z~, and the intersection is “topologically transversal”. 

Therefore, for an intersection point x of Tjj with Tj^fc, it makes sense to find another 
intersection point x' next to x if there are more than one intersection point. In this 
case, Tj^k also passes through the points x and xk Note that the intersection of these 
curves is always topologically transversal. If the “gradient” sets of these curves at x 
are in clock-wise order 

^U-,U-,x ^ ^U-,U-,x ^ ^U-,U-,x^ 

then at x' they would be in anti clock-wise order (see Figure]^ 

^U-,U-,x' ^ ^U-,U-,x' ^ ^U-,U-,x'- 



Figure 6 


However, the order property (|9.9|) is shown independent of x. The contradiction 

□ 


verifies the fact: they intersect each other only at x = xq- 


Given level sets Z-^ = 1, 2, • • •. If they intersect each other only at the origin, 

and at this point the “gradient” of — U~ are in the order 


^U-,U-,xo ^ ^U-,,U-,xo - ^U-,U-,xo 


J 3 ’ * 3 ’ 


-< 


then we say these curves are in the order 


^jl,h ^ ^j2,i2 ^ ^j3,i3 ^ 


Given two points y£, yk £ we choose other two points yo, yi £ Y^_^ ^ such 


xo,E 


that they are in the order yo ^ ye ^ Uk ^ Hi- We assume that each of these four 
points determines an backward minimal curve. Thus, we have the order: 


ye - yo < Vk - yo < Vk - ye ^ yi - ye < yi - yk- 
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By applying Lemma [9^ to the order yi — yo ^ Vk — Vo ^ Vk — ye as well as to the order 
yk — ye < yi — ye < yi — yk, we see that the order property holds 


Zn n ^ Zi,n ^ Z, n ^ Z-. u ^ Z 


^efl 


"k,0 


k,e 


1 / 


G,fc- 


Let us look this order property from another point of view. lies in the sector¬ 
shaped region bounded by the lines L^^o and Li ^ containing the curves T^ q and Li^^, 
see Figure 




Figure 7 


For any points yi, yj G ^ between y^ and yk in the following sense 

yo < ye < yi < yj <yk< yi, 

then we obtain the order property by applying the same argument 


^e,o ^ ^ ^j,i ^ ^i,j ^ 


namely, the formula (9.10) verifies the fact that Z-^ lies in the sector-shaped region 
bounded by the lines ZJ^ and and containing the curve provided U~ and U~ 
correspond to different rotation direction. Therefore, by choosing point xq suitably 
far away from the origin, Z~^ n Bk = 0- 


Therefore, this subset of K totally ordered provided each of them is valued 
zero at the point xq: U~ {xq) = 0. For any two function U~, U~ in this set we have 
either U~ (x) — U~{x) > 0 for all x G Bk, or alternatively, U~ (x) — U~{x) < 0 for 
all X G Bk- Therefore, this subset of ‘^e,k can be parameterized by “volume” in the 
following way, fix any i = i(0), other ^(ct) is defined such that 

^ = / 

J Bfc 

where the integration is in the sense of Lebesgue. The order property guarantees that 
the inverse of the map i —)• ct is well-defined. Thus, this subset of can be pa¬ 

rameterized by volume a. Notice that each non-resonant rotation vector corresponds 
to unique first cohomology class c, one obtains the dependence c = c{a) if Lo{fic) is 
non-resonant. 


With respect to a, the weak KAM solutions have ^-Holder regularity in C^-topology. 
The norm is given 

Wa - U~,\\ = max \U-{x) - C/“,(x)|. 
xGBk 


Note these functions are all Lipschitz with common constant d. Given > U^,, the 
region 


Da,a' = {(re, z) G n X M : < U^,} 
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contains at least two quarters of a cone, its height is \\\U^ — U^,\\, and the radius of 
its bottom disc is at least ^\\U~ — U~i\\, i.e. 


a — a = 


'Bk 


[u- - U-,)d^i 


vr 




-||3 


from which one obtains the ^-Holder regularity immediately. 

Since ^ can be covered by several sets {yg : 6 G 0^}, we obtain the regularity 
of the whole The regularity extends naturally to the whole □ 


As all weak KAM solutions are Lipschitz functions with bounded Lipschitz constant, 
each function in ^ek\‘^ek weak KAM solutions for c G C^; such that uj{fj,c) 

is resonant. The circle Ce contains countably many edges (flat) in general. For 
c G the c-minimal measure is supported on periodic orbit. Let Ci/, Ci^r be the 
endpoint of F^;,, and let {ck/}, {ck,r} be two sequences such that both u{ck/) and 
uj(ck^r) are non-resonant, —>■ and Ci^r as /c —)> oo, one obtains weak 

KAM solutions Uc^ ^ = limfc_,.oo U~^ ^ and 1/^ ^ = lim^^oo U~^ ^. The subscript i and 
r indicate the left and the right respectively. If 11^ (, is a backward weak KAM, then 
Uc will be a forward weak KAM, and vice versa. 


When a minimal measure is supported on periodic orbit, of which the lift divides 
into two half plane. The globally elementary weak KAM solution admits a decom¬ 
position of periodic solution and an affine function when it is restricted on half plain 
(see Proposition A.6). If the flat ¥E,i is non-trivial (not a point), these two affine 
functions are different. 


9.3. Extension of weak KAM solutions using the normal hyperbolicity. If 

such modulus continuity is established on certain normally hyperbolic manifold, we 
can extend it to the stable and unstable fibers. 

Theorem 9.3. Let x C x M" be a NHIM for the Hamiltonian flow and let 
be elementary weak KAMs defined on ¥"■. If u^\jk is 2K-Hdlder continuous in a, 
then is K-Holder continuous in the parameter a. 


Proof. By definition, certain constant C > 0 exists such that \uf —u~,\ < C\a — 

Given two parameters u, a', we claim that in each \a — in contains a point 

X where \duf{x) — du~,{x)\ < 4C'|cj — Since Lipschitz function is differentiable 
almost everywhere, if \duf — du~i\ > dCIcr — a'\^ holds for almost every point in the 
|(T — cj'l'^-ball centered at x, there exists a point x' in this ball and a path C, connecting 
these two points such that 


\{u„ {x') - u^flx)) - {u„ (x) - v(x))| 


J{du^ —du^,,dT) > 2C\ 


a — a 


.I\2k 


which contradicts the 2K-Holder continuity. 

As X is a normally hyperbolic invariant manifold, z = {duf{x),x) is a point 
in this NHIM if x lies in this A:-torus where uf is differentiable. This point has its own 
stable and unstable fiber, denoted by T^. Because the differential of backward weak 
KAM determines backward c((T)-minimal orbit, almost every point x' in a neighbor¬ 
hood of T*' uniquely determines a point {x',duf{x')) which lies on certain unstable 
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fiber with z = {du^ {x),x) and we have 


Ua ix') - (x) 



dvrr^ ), 


where ttF” denotes the projection of F” down to the conhguration space. Since the 
unstable hber is continuous in its base point, in \a — cj'|'^-neighborhood of 

x' which lies in a neighborhood of there exists a point x" such that both u~ and 
u~, are differentiable at x”, the point {x”,du~{x”)) belongs to a unstable fiber F^ 
which is \a — cr'I'^-close to another unstable fiber FJ, to which {x",du~,{x")) belongs. 
Therefore we obtain 


V(x') - u„ (x')| <\u^,{x') - {x')\ +Ci\a- cj'|'" 

<\u~,{'Kz') - u“(7r2;)| -I- (Filer - o'\^ 


+ 



dvrF^,) - 


dvrFj 


<C2|cT-f7'r 


where the hrst inequality is due to the Lipschitz property of weak KAM, the third 
one is obtained from the fact that two fibers keep |cj — cr'|'^-close to each other. □ 


Appendix A. A brief introduction to Mather theory and weak KAM 

We use the variational method to prove the main result, which is based on Mather 
theory. In this appendix, we give a brief introduction to the Mather theory and weak 
KAM theory. 

A.l. Minimizing measure and a,/! function. The theory is established for Tonelli 
Lagrangian. 

Definition A.l. Let M be a closed manifold. A -function L: TM x T —)• M is 
called Tonelli Lagrangian if it satisfies the following conditions: 

(1) Positive definiteness. For each {x,t) £ M x T, the Lagrangian function is 
strictly convex in velocity: the Hessian d±xL is positive definite. 

(2) Super-linear growth. We assume that L has fiber-wise superlinear growth: for 
each {x,t) G M x T, we have T/||x|| -£■ oo as ||x|| —)• oo. 

(3) Completeness. All solutions of the Lagrangian equations are well defined for 
the whole t G M. 

For autonomous systems, the completeness is automatically satisfied, as each orbit 
entirely stays in certain compact energy level set. 

Given a closed 1-form {ric{x),dx) with first cohomology class [{ric{x), dx)] = c, we 
introduce a Lagrange multiplier r]c = {ricix),x). Without danger of confusion, we call 
it closed 1 -form also. 

For each curve 7 : M —)■ M with period k, there is unique probability measure 
fiy on TM xT so that the following holds 

[ fdT'r = Z f fid7is),s)ds 
JtMxT «Jo 

for each / G C^{TM x T,M), where we use the notation dy = ( 7 , 7 ). Let 
io* = {pL-f I 7 G C'^(M, M) is periodic of k}. 



ARNOLD DIFFUSION 


101 


The set Sj of holonomic probability measures is the closure of io* in the vector space 
of continuous linear functionals. One can see that Sj is convex. 

For each u G the action Ac(i^) is defined as follows 

Ac(iy) = j{L- r]c) dv. 

It is proved in [M911 IMej that for each co-homology class c there exists at least one 
invariant probability measure /ic minimizing the action over 

Acidic) = inf {L - rjc) du, 

called c-minimal measure. Let i^c C be the set of c-minimal measures, the Mather 
set A4{c) is defined as 

M(c) = IJ suppfj-c- 

The a-function is defined as a(c) = —Ad/Uc) : ff^(M,R) —)> M, it is convex, finite 
everywhere with super-linear growth. Its Legendre transformation /3 : —)• R 

is called /3-function 

(3{uj) = max((a;, c) — a(c)). 

C 

It is also convex, finite everywhere with super-linear growth (see [M91| L 

Note that f Xd^^ = 0 holds for each exact 1-form A and each G io*. Therefore, 
for each measure fj. € one can define its rotation vector w(/i) G Hi(M,R) such that 

([A],a;(/i)) = j Xdn, 

holds for every closed 1-form A on M. By the following relation 
ce^fiip) a(c)+/3(p) = (c,p). 

one obtains the Fenchel-Legendre transformation Hi{M,R) —)■ H^{M,R). 

A.2. (Semi)-static curves, the Aubry set and the Mane set. The concept of 
semi-static curves is introduced by Mather and Mane (cf. [M931 IMej 1. A curve 7 : 
M —)• Af is called c-semi-static if in time-l-periodic case we have 

[^c(7)l[M']]=i^c((7(t)T), (7(0,0) 

where ^ 

[^c( 7 )l[t,i']] = ^ (^( 0 (i),i) -^c( 0 (i))) dt + a{c){t' - t), 

= inf hci{x,T),{x',r')), 

T = t mod 1 
T^ = t^mod 1 

in which 

/ic((x,r), (x',r')) = inf [Ac(OI[r,r']]- 
5eci 

i(T)-X 

In autonomous case, the period is considered as any positive number. Consequently, 
the notation of semi-static curve in this case is somehow simpler 

Oc(7)l(i,t')] = ^c(7(0,7(i')), 

Fc{x,d) = inf /ic((x, 0 ), (x',r)). 

r>0 


where 
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Convention: Let I C M be an interval (either bounded or unbounded). A continuous 
map 7 : / —>■ M is called curve. If it is differentiable, the map d'j = ( 7 , 7 ): I —)• TM 
is called orbit. When the implication is clear without danger of confusion, we use 
the same symbol to denote the graph, 7 := Ut£i{'y{t),t) is called curve and d-j := 
Ut£i{'y{t),'y{t),t) is called orbit. In autonomous system, the terminology also applies 
to the image: 7 := Ut£i'y{t) is called curve and d^ := is called orbit. 

A semi-static curve 7 G C^(M, M) is called c-static if, in addition, the relation 

K(7)l(M0] = -^c((7(O,r),(7W,r)) 
holds in time-l-periodic case and 

[^c(7)l(t.i')] = 

holds in autonomous case. An orbit X{t) = {d'y{t),tmod 27r) is called c-static (semi¬ 
static) if 7 is c-static (semi-static). We call the Mane set M{c) the union of c-semi- 
static orbits 

■^(c) = : 7 is c-semi static} 

and call the Aubry set ^(c) the union of c-static orbits 

Ac) = : 7 is c-staticj. 

We use M.{c), A{c) and Af{c) to denote the standard projection of A4(c), ^(c) and 
M{c) from TMxTtoMxT respectively. They satisfy the inclusion relation 

M{c) C A{c) C AA(c). 

It is showed in |M911 IM93) that the inverse of the projection is Lipschitz when it is 
restricted to ^(c) as well as to M.{c). By adding subscript s to A/", i.e. A/’s we denote 
its time-s-section. This principle also applies to M{c), ^(c), A4(c), A{c) and Ad(c) 
to denote their time-s-section respectively. For autonomous systems, these sets are 
defined without the time component. 

On the time-1-section of Aubry set a pseudo-metric dc is introduced by Mather in 
[M93| . its definition relies on the quantity . Let 

/i“((x,s),(x',s')) = liminf hci{x,t), {x',t')), 

s=t mod 1 
t'=s' mod 1 

For autonomous system 

(x, x') = liminf hc{{x, 0 ), {x', r)). 

T^OO 

The pseudo-metric dc on Aubry set is defined as 

dc{ix,t), {x',t')) = /i~((x,t), (x',t')) + 

With the pseudo-metric dc one defines equivalence class in Aubry set. The equivalence 
(x, t) ~ {x\ t') implies dc{{x, t), {x\ t')) = 0, with which one can define quotient Aubry 
set A{c)/ ~. Its element is called Aubry class, denoted by Ai{c) or Ac,i, its lift to 
TM X T is denoted by Ai{c). Thus, 

Ac) = IJ A(c), ^(c) = [J Ai{c). 

ieA ieA 

Although Mather constructed an example with a quotient Aubry set homeomorphic 
to an interval, it is generic that each c-minimal measure contains not more than n -|- 1 
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ergodic components if the system has n degrees of freedom |BC| . In this case, each 
Aubry set contains at most n + 1 classes. 

The definition of semi-static curve as well as of Mane set depends on which con¬ 
figuration manifold under our consideration. Let vr : M —)• M be a finite covering, a 
curve 7: M —)• M is said semi-static in M if each curve in its lift 7 is semi-static in M. 
Accordingly, we define M{c, M) {N{c, M)) as the set containing all c-semi-static orbits 
(curves) in M. We use the symbol M{c) when M is defaulted as the configuration 
manifold. 

It is possible that 7 rN'{c,M) D For instance, if C M is a open region 

such that Hi{M, N,Z) 7^ Z), J\f <Z N and the lift of in M has more than 

one connected component, then this phenomenon takes place. But we have 

Proposition A.l (Proposition 1.1 of |C12] 1. Let tt : M ^ M be a finite eovering 
spaee, then 

7rA(c,M) =A{c,M). 


A. 3. Elementary Weak KAM. The concept of c-semi-static curves can be extended 
to the curves only defined on M^, which are called forward (backward) c-semi-static 
curves, denoted by 7^ respectively. A curve 'yfi (7+) produces a backward (forward) 
semi-static orbit orbit (ilcAit))- 

Proposition A.2. If the Lagrangian L is Tonelli type, for eaeh point (x, r) G M x T, 
there is at least one 7^(t,x,T) which is forward (backward) semi-static curve. 


Since both the w-limit set of d'yfi and the a-limit set of d'y^ are in the Aubry set 
one define 


nf = 


u 

(x,r)GMxT 


x,r, 


d'yf(T,x,T) 

dt 


and call Wfi the stable set, Wfi the unstable set of the c-minimal measure respectively. 
If 7“(r, x,r) = 7+(r, x,r) holds for some (x,r) G M x T, passing through the point 
(x, r, 7Z (t, X, r)) the orbit is either in the Aubry set or homoclinic to this Aubry set. 

If the Aubry set consists of one class, the stable as well as the unstable set has its own 
generating function such that = Graph((iu^) holds almost everywhere [Ml- 
These functions are weak KAM solutions. We use to denote the weak KAM solution 
for the Lagrangian L — rjc, where rjc is a closed form with [pd = c. These functions are 
Lipschitz, thus differentiable almost everywhere. At each differentiable point (x,r), 
(x, T, dxU~{x, r)) uniquely determines backward c-semi static curve 'yf: (—00, r] —)• M 
such that 7T('l) = x, ffir) = dyH(x,T,dxU~ (x,t)). Similarly, (x,T,dxU'^(x,T)) 
uniquely determines forward c-semi static curve 'yfi: [r, 00) —>■ M such that 'jfi (r) = x, 

ixiT) = dyH{x,T,dxU+{x,T)). 

Recall the quantity h'fi‘(x,x') for autonomous system. It is a backward weak KAM 
if we think it as the function of x', a forward weak KAM for the variable of x. 

Elementary weak KAM solution is introduced when the Aubry set consists 
of finitely many classes. One Aubry class Ac,i determines a pair of elementary weak 
KAM Let x G Ac,i, one has a decomposition 


/i“(x,x') = u^ fix') - uA(x), 
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where is a backward weak KAM, we call it elementary for Ac,i- Similarly, one can 
also define uA. We introduce a barrier function for Aubry classes Ac,i and Acj 

Bc,i,j = n" - u+j. 

Elementary weak KAM solutions generate all weak KAM solutions in the sense as 
follows. 

Proposition A.3. Assume the minimal measure consists of m ergodic components. 
For each weak KAM solution , there exist m' {m' < m) constants ,d^, 

and m' open domains Df , • • • , D^, such that they do not overlap each other, M = 
Fii<i<m'Df and 

(A.l) '^^ln± — F df, VI < z < m'. 

^ i 


Proof. It is deduced from the Lipschitz property of u~ that it is differentiable almost 
every where. Let x be a point where u~ is differentiable, du~{x) determines a unique 
backward semi static orbit d'jl: (—oo,0] —)• M whose a-limit set is in certain Aubry 
class A*. By definition we have 

/■o . 

u (x) — u (7c(—i)) = J Lcid'ylfs), s)ds + a{c)t. 


Let tk ^ oo such that 7c(—tfc) —>■ x' G A*, one has 

(A.2) u~{x) = /i“(x',x) + ti+(x') = u~-{x) + di, 

since the difference of any two weak KAMs is constant when they are restricted on an 
Aubry class. 


If X* G M is another point where du~ (x* 
whose a-limit set is also contained in A*, we then obtain (A.2) for u 
same d. 


All these points constitute a set connected with A[ 
than m connected sets such that (A.l) holds. 


determines a b ackw ard semi-static orbits 

(x*) with the 
There are not more 


□ 


Globally elementary weak KAM solution is defined for the universal covering 
space of the configuration manifold. It is achieved by considering the elementary weak 
KAM solutions for arbitrary finite covering space of T”. 

Let k = (ki,..., kn) G Z"" with k* > 1, kT” = {x G M” : Xj mod kj}. Given a finite 
covering Try. kT” —>■ T”, a function Uc- ¥”■—>■ M has its natural lift Ucy. kT" —>■ M in the 
way that Uc,k{x) = Uc{7rkx)- If a finite covering of is considered to be configuration 
space, the weak KAM solution may not be 27r-periodic. The lift of elementary weak 
KAM solution may not be elementary. However, one has 

Proposition A.4. If there is only one Aubry class for the finite covering kT"’ — )• 

then the globally elementary weak KAM solution, denoted by uf^^, is the lift of the 
elementary weak KAM solution for T”; = uf^^. 

Proof. For a finite covering vr^ : kT”’ if the Aubry set still consists of only one 

class, then only one pair of weak KAM solution exists, denoted by uf^^. 

Let (ic,k) quantity of dc, h^ defined for the configuration space kT**, then 

/l“^(x, x') > A“(7rkX, TTkX') > 0 if TTkX = TT^x'. As dc{x, x') = h^^{x, x')-|-/i“^(x', x) = 0 
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if TTkX = TTkx' G Ac, it follows that h^f^(x,x') = 0. Consequently, = uf^^{x') if 

TTkX = TT^x' G Ac- 

Let L>k be a Deck transformation on kT”. For any x,x' G kT” with vr^x' G Ac, one 
has 

) =^“k(®>®0 = h'^^,{D^,x,D^,x) 

=u“k(-DkS) - u^^^^^{D^,x') 

=u-k(Dkx) -u+^(x'). 

Therefore, one has u~^^{x) = u~^^{D\^x) holds for any Deck transformation on kT”. □ 


The lift of an Aubry class Ai may contain several connected components {Aij}. It 
is closely related to whether the first relative homology of ¥"■ with respect to Ai is 
trivial. Let 

Hj = M) : 17is a neighborhood ofTl* in T*^}, 

u 

where ijj- U ^ M denotes the inclusion map. Small neighborhood U exists such that 
Hi = i;7*77i(17, M). If i7i(T’^,Z) D Hj, suitable finite covering spaces exist such that 
the lift of an Aubry class contains more than one connected component. 

For kT” with large min kj, we fix one Aubry class, denoted by A^. It is distinguished 
by fixing a point x G {x G kT" : 0 < x^ < 1} Pi Tlj such that x G which determines 
a pair of elementary weak KAM uA. The elementary weak KAM solution of other 


Aubry class in the lift is just a shift of this function uf^^{x) —>• uf^^{x + xq) 


Proposition A.5 (Lemma 2.1 of |C12j i. Given an Aubry class Ai and a bounded 
domain D C M", there exists > 0 such that for any k, k' with min{kj,k(} > Rq 
such that 


^ + constant 


where uf^, uf^^, are treated as 2kvr-periodic and 2k 'tt -periodic function respectively. 


,± 


Therefore, some function is well-defined (called globally elementary weak KAM 
solution) satisfying the condition: given any bounded domain D, there exists Rq, > 0 
such that 


■“itoolc! = "“itklc! + constant. 


provided min{kj} > R^. 


The globally elementary weak KAM solution has some special property if the Aubry 
set contains a connected component homeomorphic to T"“^. In this case, the lift of 
Ai divides M" into two parts, denoted by R~ and R''^. 


Proposition A.6 (Theorem 2.5 of |C12p . If the Aubry set contains a connected 
component contains an Aubry class Ai homeomorphic to T"“^, then the globally ele¬ 
mentary weak KAM solution uf : M" —>• M has a decomposition 


^i,oo ''^i,oo 1,001 

where is periodic and is affine when they are restricted in the half space R'^ 
as well as in another half space R~. 
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For a convex function one can define its sub-derivative. The set of sub-derivative 
of a convex function tp at x 

D~ip{x) = {y G M" : ip{x') — ip{x) > (y, x — x) V x' G M"'} 

is convex. Since each backward weak KAM solution u~ has a decomposition u~ = 
(p — pj where p is smooth and ip is convex, we define 

(A.3) D^u~{x) = {(p'{x) — y : y is a sub-derivative of ip at x}. 

Theorem A.l. Let up be a baekward weak KAM, 7 c.' (—oo,0] M is {up,Lc)- 

calibrated curve. Then, ^f(7c(0), 7c(0)) + c holds on the boundary of D^up{x). In 
particular, y is a vertex of up (x) only when there exists a calibrated curve 7 such 
thaty + c= ^( 7 c( 0 ), 7 c( 0 )). 

Proof. A curve 7; (— 00 ,0] —)• M is called {u, Lc)-calibrated if 

M(7(t')) - uplit)) = Lc{'y{s),p{s))ds -h {t' - t)a{c) 

holds for each —00 < t < 0. We say that (x,n) determines a (u“, Lc)-calibrated 
curve 7: (— 00 , 0 ] —)■ M if 7 ( 0 ) = x and 7 ( 0 ) = v. We say that v G Vup{x) C MP if 
(x,n) determines certain (n“,Lc)-calibrated curve 7: (—oo,0] —)■ M. By Proposition 
4.5.1 in |Fa2| . one has 

O T 

(A.4) up{x -I- 5h) - up{x) < 5(-^{x,v), h^ + 0(6^) 

for each (v,h) G Vup{x) x M". It implies that 

co{dxLc{x,v) : V G Fn“(x)} C D~^up{x). 

We use coS denote the convex hull of the set S. 

Adding a constant to the Lagrangian Lc ^ Lc + a{c), we assume a{c) = 0. Also, 
we only need to consider the case c = 0. Let 7: (— 00 ,0] —)> M be a backward c-semi- 
static curve, calibrated for up, with 7 ( 0 ) = x and 7 ( 0 ) = v. Let xi = 7(— (5) with 
5 > 0, then x — xi = dv + 0(6'^) if 6 is sufficiently small. Let p = 6 + v with z/ > 0, 

[—y, —5] —)> M be a curve connecting 7 (—/r) to 7 ( 0 ) such that .^(s) = 7 (^^(s -|- <5)) 

(A.5) u~{x)-u~{j{-ia)) = f L{j{s),'y{s))ds 

J 

< [ L{C{s),f{s))ds. 

J —fL 

(A. 6 ) u~{xi)-u~{'y{-n))= f L{j{s),p{s))ds, 

J 

As (5 > 0 is assumed small, by definition we obtain 
h(»)-«»)ll = 0(|i|), 
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Since 7 ( 5 ) is a solution of the Lagrange equation, dxL{'y{s),j{s)) = ■^dxL{'y{s),j{s)). 
Consequently, one has 

L(7(s),7(s)) -L(^(s),^(s)) =^(^(7('S),7(s)),7('S) 


+ 

Since j(—S) = 7 ( 0 ) — Sv + 0(|(5p), we find |u“(xi) — u~(x — (5u)| = O(S^) as weak 
KAM solution is Lipschitz function. Subtracting (A.6) from (A.5), let ly = 0(1), note 
7 (—5) = xi, ^(—5) = X and 7 (—= ^(—ju), one deduces 

u~(x - Sv) - u~(x) > f L{'y{s),^{s)) - L{^{s),i{s))ds 

J 

'dL, 




By using (A.4) one then deduces that 


O 7" 

u“(x) — u~{x — 6v) = (5^ —(x, v),v'^ + 0{6^). 

That is, dxL{x,v) is on the boundary of D~^u~{x) provided v G Vu~{x). 

To complete the proof, let us recall the concept of reachable gradient (cf. |CS| 1: 


Definition A. 2. Let u: A —)> M he locally Lipschitz. A vector p is called a reachable 
gradient of u at x G A if a sequence {x^} C A\{x} exists such that u is differentiable 
at Xk for each k G N and 


lim Xk = X, lim Du{k) = p- 

k—^oo k£oo 

The set of all reachable gradient of u at x is denoted by D*u{x). 


If M is a semi-concave function then it is proved in |CS] that 

D^u{x) = coD*u{x). 


Therefore, there exists a sequence Xfc —>■ x such that u~{xk) is differentiable and 
du~{xk) —)• y provided y is a vertex of D~^u~{x). It is well-known that u~ is differ¬ 
entiable at X if and only there is only one v G Vu~{x) such that (x,u) determines a 
unique (tt“, L)-calibrated curve 7 : (—oo,0] —)• M with 7 ( 0 ) = x and 7 ( 0 ) = v. The 
derivative of u~ at this point is a singleton 


Du (x) 


cIL 

dx 


(x,u). 


Let Xfc —>■ X be a sequence such that u~ is differentiable at each x^, corresponding 
speed is denoted by Vk- If u is the accumulation point of {vk}, by the upper semi¬ 
continuity of the set of backward (forward) semi-static curves in cohomology class, 
(x,u) determines a (m“, L)-calibrated curve. Clearly, one has y = |^(x,u), which 
completes the proof. □ 
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